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EDITORIAL 


Big science ts hard but worth it 


ig science is hard. It is the throw-deep approach 
that pushes technology to the edge to achieve 
stunning breakthroughs that dramatically ex- 
tend the frontiers of science, while inspiring 
the next generation of scientists and capturing 
the imagination of the public. The Large Had- 
ron Collider (LHC) produced the Higgs boson, a 
particle that explains why all other particles have mass 
and whose name is now known around the world; the 
Atacama Large Millime- 
ter and Submillimeter Ar- 
ray (ALMA) revealed some 
of the earliest galaxies, as 
well as a nearby planetary 
system that has a striking 
resemblance to our own; 
and for 25 years, the Hubble 
Space Telescope has dazzled 
us with its discoveries and 
iconic astronomical images. 
But before they became fa- 
mous for discoveries, these 
and other big projects were 
infamous for their problems. 
What are the challenges for 
big science, and what does it 
take to succeed? 

Four hundred years ago, 
Galileo urged “measure what 
is measureable” and “make 
measureable that which is 
not.” Until recently, the latter almost always involved a 
single scientist or small group inventing new instrumen- 
tation (and often still does). But as science has matured, 
advances now often require big teams and expensive fa- 
cilities. The technological challenges are daunting, from 
operating a 100-ton, 27-km superfluid helium system at 
1.9 kelvin at the LHC to preparing for the unfolding of 
the James Webb Space Telescope’s (JWST’s) 18-segment, 
6.5-m mirror, 1.5 million km from Earth in 2018. Socio- 
logical, budgetary, organizational, and cultural issues can 
be even more difficult. Big-science time scales are longer, 
making it harder for graduate students to complete thesis 
work and for postdocs and assistant professors to achieve 
results that advance their careers in a timely way. Proj- 
ects are so large and expensive that proper management 
is complex, and even small budget overruns have broad 
programmatic impact. Big science often involves partner- 
ing, and whether it is multiple funding agencies, several 
countries, or a public/private partnership, getting part- 
ners with varying cultures to act coherently is difficult. 


ALMA’s astronomical discoveries exemplify the triumphs 
of big science projects 


“It is the throw-deep approach that 
pushes technology to the edge...” 


And yet big science can overcome these hurdles and 
be triumphant. From my experience at the U.S. National 
Science Foundation with big projects, including ALMA 
and the LHC, I see four key ingredients for success at this 
scale. All of the stakeholders, from the science commu- 
nity to the funders, must view the science as truly wor- 
thy of a big-science approach and must be committed to 
seeing things through during good times and bad. With 
one-of-a-kind, envelope-pushing projects, problems will 
undoubtedly arise, but with 
the appropriate manage- 
ment structure, indepen- 
dent oversight, and project 
contingency funds, they can 
be identified quickly and 
solved. Partners must be 
committed to the project 
and to their well-defined, 
agreed-upon __ responsibili- 
ties and willing to delegate 
authority to a managing 
partner or to central project 
management. Last, but not 
least, all involved must be 
honest and realistic about 
both progress and problems 
and willing to make hard 
decisions, whether it be de- 
scoping, raising additional 
funds, or even cancellation. 

Four years ago, the U.S. 
Congress came close to cancelling Hubble’s successor, 
the JWST. Although the details of how big projects suf- 
fer near-death experiences vary, JWST’s problems were 
not atypical: The budget had gotten out of control, 
the management structure was insufficient, and there 
was an all-around failure to acknowledge difficulties. 
Thankfully, the science community—not just astrono- 
mers—recognized the importance of game-changing 
projects such as JWST and rallied to its support. NASA 
made major management changes, and Congress com- 
mitted to the needed funding, despite tough federal 
budgets. Today, JWST still faces substantial technical 
challenges as testing and systems integration begin, 
but it is on schedule and on budget with appropriate 
resources to address contingencies. 

When we see the stunning images from JWST and 
hear about the breakthroughs, we will all be proud of 
what was accomplished and know that it was worth the 
hard work and big investment. 

- Michael S. Turner 
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Sikuliaq ready for duty 


The RV Sikuliaq in the Bering Sea during ice and science trials. 


he rumbling of the RV Sikuliaq’s engine was music to ocean 
scientists’ ears this month, during a 23-day cruise to test how 
the newest addition to the U.S. oceanographic fleet handled icy 
seas. Starting from the island of Amaknak in Alaska, the ship 
crunched north into so-called ten-tenths sea ice—the name 
shiphands give to a sea ice coating that stretches to the hori- 
zon. The 80-meter-long, $200 million Sikuliag is not an icebreaker, 
but its hardened hull is rated to move through solid sea ice as thick 
as 0.8 meters—a rating that will allow scientists on board to access 
icy areas during the fall and spring, such as the southern parts of 
the Bering Sea. The Sikuliaq easily passed its various trials: Winches 
delivered sampling equipment in ice and water, and crews exited 
the ship on foot to take samples from surrounding ice. “She’s a great 
ship, and a great addition to the U.S. science fleet,” says chief scien- 
tist Carin Ashjian, a biological oceanographer from the Woods Hole 
Oceanographic Institution in Massachusetts. Sikuliag begins full re- 
search operations in ice later this year. http://scim.ag/Sikuliaq 
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39.6% 


Fraction of foreign students in U.S. science and engineering graduate programs 
in 2013. That all-time record is due both to increases in temporary visas and declining 
domestic enrollment, according to the National Science Foundation. 


AROUND THE WORLD 


Gene editing patent fight 


OAKLAND, CALIFORNIA | The dispute over 
who owns the rights to a groundbreaking 
and lucrative gene-editing technique has 
reached the U.S. Patent and Trademark 
Office. The University of California (UC) 

last week asked the office to reconsider 

10 patents awarded to synthetic biolo- 

gist Feng Zhang of the Broad Institute in 
Cambridge, Massachusetts, for a method 

of precisely cutting or replacing letters in 

a DNA sequence that could lead to new 
treatments for genetic diseases. UC Berkeley 
biochemist Jennifer Doudna and collabora- 
tors first described the technique, often 
called CRISPR-Cas9, in a 2012 Science paper, 
but Zhang won the patents last year by sub- 
mitting notebooks that suggest his discovery 
predated Doudna’s. In what could be a long 
and costly battle, both sides will present 
evidence for the dates of their findings to a 
board of patent examiners. 


Rubber plants imperil biodiversity 


XISHUANGBANNA, CHINA | A growing 
demand for rubber products—particularly 
tires—has fueled the conversion of more 
than 2 million hectares of forests and 
farms worldwide into rubber plantations 
in the past decade. That could exacerbate 
the extinction crisis in southern China 
and other parts of Southeast Asia, new 
research in Conservation Letters suggests. 
The researchers found that conversion of 
forest to rubber monoculture decreases 
the number of bird, bat, and insect spe- 
cies. Additionally, pesticide, herbicide, and 
sediment runoff affects aquatic life, and 
the loss of smaller trees and shrubs leads 


Harvesting latex from the caoutchouc, or rubber tree. 
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to soil erosion and landslide risk. Some 
84% of the world’s existing 9.9 million 
hectares of rubber trees are in Southeast 
Asia, and global rubber consumption will 
likely grow 3.5% annually, the team found. 
But there may be hope: Sustainability 
certification schemes have reduced the 
negative impacts of oil palm and paper 
and pulp growing. A similar effort for 
rubber, the Sustainable Natural Rubber 
Initiative, launched its pilot phase in 
January. http://scim.ag/rubberplants 


Anger over “consensus center’ 


PERTH, AUSTRALIA | The Australian gov- 
ernment has again angered the country’s 
scientists by announcing it will contribute 
$4 million toward a new center to be built 
at the University of Western Australia’s 
Business School in collaboration with 

the Copenhagen Consensus Center, a 
Massachusetts-based think tank created 
and directed by controversial environmen- 
tal author Bjorn Lomborg. Lomborg has 
crossed swords with scientists for his views 
on mitigating climate change. “In the face of 
deep [funding] cuts to ... scientific research 
organisations, it’s an insult to Australia’s 
scientific community,” said Australia’s 
Climate Council, a nonprofit science and 
outreach center established by scientists 
after the Australian government shuttered 
its climate change commission in 2013. The 
Australian Consensus Centre will focus on 
the economic implications of agriculture, 
aid, and global development, according to a 
university official, but not climate change. 


NEWSMAKERS 


Three Q’s 


In March of 1970, forestry graduate 
student Doug Scott of the University of 
Michigan helped create a massive, 5-day 
“Teach-in for the Environment.” The 
teach-in was a precursor to the first Earth 
Day on 22 April, an environmental activ- 
ism event founded by Wisconsin Senator 
Gaylord Nelson that drew 20 million 
people across the United States. 


Q: Ecologist Barry Commoner, once 

called the “Paul Revere of ecology,” was 
there; what role did he play? 

A: Our kickoff event ... overflowed a 
14,000-person basketball arena. The event 
started with the cast of Hair and had 
Nelson and the governor of Michigan. 
Commoner was the central speaker. [But] 
he was a college professor, not used to 
speaking in front of the hot lights where 
you couldn’t see the audience. 
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Festival aims to make math fun 


igh-fives and wild laughter were plentiful at the United States’ first National 

Math Festival. The festival, held on 18 April in Washington, D.C., aims to help kids 

and adults see the beauty and wonder of math, says David Eisenbud, director 

of the Mathematical Sciences Research Institute (MSRI) in Berkeley, California. 

“People don’t really understand what mathematics is about, and if they under- 
stood it they would all like it as much as | do.” MSRI co-organized the festival with the 
Institute for Advanced Study and in collaboration with the Smithsonian Institution. 
Children made Mobius strips, gaped at mathematical card tricks, and clutched balloon 
octahedra in lieu of balloon animals. In a race called the “Oobleck Olympics,” teams 
competed to pour water out of jugs—sped up by swirling the bottle to create a vortex, 
as shown—while cornstarch and water on a speaker danced to the beat of music. 


Q: What happened during the teach-in? 

A: We basically took over the campus for 
5 days. Hundreds of people turned out to 
help organize. Professors devoted class 
time to the topic of the environment. 


Q: Why did science students lead the charge? 
A: I suppose we were paying more attention 
to the impact that man was having on the 
environment. But the whole university 

got involved. The law school hosted a 2-day 
symposium about what was then the 
cutting-edge topic of environmental law. You 
had to be pretty dedicated to your studies to 
not know what was going on and join in. 


Science funding head exits 


The head of Portugal’s science funding 
agency, biomedical researcher Miguel 
Seabra, stepped down this month amid 
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mounting criticism of his agency’s 
policies. The Foundation for Science and 
Technology (FCT) angered researchers in 
January 2014 when it announced a sharp 
drop in state-funded Ph.D. and postdoc- 
toral fellowships. Seabra also oversaw a 
controversial evaluation of the country’s 
R&D units, announcing in June 2014 that 
22% of the 322 evaluated units would 
lose their funding due to poor ratings, 
and another 26% would see their budgets 
reduced to “core funding.” Critics slammed 
the evaluation process as neither robust 
nor transparent. Crystallographer Maria 
Arménia Carrondo, a former adviser to 
FCT’s board, will take over from Seabra, 
the Ministry of Education and Science 
announced last week, but some fear her 
close connection to Seabra means that no 
drastic change is likely to happen. 
http://scim.ag/Seabra 
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Seeking tests for a contested brain disease 
FDA warning to UCLA highlights struggle to diagnose chronic traumatic encephalopathy 


By Emily Underwood 


ony Dorsett was always quick on his 

feet, nimbly evading many crushing 

collisions as a star running back for 

the Dallas Cowboys. Still, the former 

National Football League (NFL) player 

sustained numerous concussions dur- 
ing his 12-year career. Now, Dorsett believes 
he is paying for those hits. 

Last year, several news outlets reported 
that after having his brain scanned at the 
University of California, Los Angeles (UCLA), 
Dorsett was told he shows signs of chronic 
traumatic encephalopathy (CTE), a neurode- 
generative disease associated with repeated 
blows to the head. Along with several other 
former pro football players scanned at UCLA, 
the 61-year-old now attributes his failing 
memory and mood swings to CTE. 

Dorsett’s bombshell shocked the sports 
world—and alarmed many CTE researchers. 
Despite a widespread belief that the brain 
disorder is common among athletes in high- 
impact sports, there is no proven method 
of diagnosing CTE while a patient is alive, 
says Samuel Gandy, a neurologist at Mount 
Sinai Hospital in New York City. The medical 
literature lists only about 100 proven cases, 
all based on postmortem analysis of brain 
tissue. The scarcity of evidence, Gandy says, 
has fueled “controversy over what CTE is, 
and if it exists.” Meanwhile, high-stakes law- 
suits, including ones filed by former players 
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against the NFL, have added to the pressure 
on researchers such as Gandy, who are strug- 
gling to come up with methods for diagnos- 
ing and tracking the disorder. 

Last week, when CTE researchers met at a 
traumatic brain injury conference in Wash- 
ington, D.C., to take stock of their fledgling 
field, they discussed some tantalizing leads. 
But hallway chatter centered on a new con- 
troversy. As first reported by the Los Angeles 
Times on 10 April, the U.S. Food and Drug 
Administration (FDA) in February ordered 


“There are no diagnostic 
criteria for CTE in vivo.” 


Douglas Smith, University of Pennsylvania 


the doctor who delivered the bad news to 
Dorsett, UCLA psychiatrist Gary Small, to 
remove promotional language from the 
website of a company, TauMark, which has 
licensed his and colleagues’ research into 
imaging a protein called tau. The company’s 
site once displayed the slogan “Better Brain 
Diagnostics” and claimed that its PET scans 
could detect signs of CTE in living people. 
But those at the D.C. meeting agreed with 
FDA’s conclusion that the company’s mes- 
sage was misleading. “There are no diag- 
nostic criteria for CTE in vivo,’ declared 
neurologist Douglas Smith of the University 
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of Pennsylvania. “We need to clear the air” 

Only in the past month or so have re- 
searchers arrived at a consensus about what 
CTE looks like in postmortem brain tissue, 
Robert Stern, a neurologist at Boston Univer- 
sity, told meeting attendees. Last month, eight 
neuropathologists convened to examine digi- 
tal images of brain slices taken from people 
who had had a variety of neurodegenerative 
conditions, including Alzheimer’s disease 
and suspected CTE. Blind to each sample’s 
clinical diagnosis, the group identified two 
patterns that set CTE apart from other pa- 
thologies: clusters of a molecule called tau 
sequestered inside neurons that surround 
blood vessels in the brain; and clumps of tau 
in neurons and other brain cells at the bot- 
toms of sulci, the folds that make up a hu- 
man’s wrinkly cortex. The group’s findings, 
which will be presented at the American 
Academy of Neurology meeting in Washing- 
ton, D.C., this week, are the first strong dem- 
onstration that CTE “is indeed a pathological 
disease that is unique,’ Stern says. 

Small and his UCLA colleagues say they 
have found similarly located deposits of tau 
in living patients using their noninvasive 
technique. In a series of papers, including 
one online on 6 April in the Proceedings of 
the National Academy of Sciences (PNAS), 
they reported injecting former NFL athletes 
with a patented radioactive compound called 
FDDNP. The compound, originally designed 
to study B amyloid plaques in Alzheimer’s 
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Former running back Tony Dorsett (33) was 
reportedly told that his football career had likely 
given him a neurodegenerative disease. 


disease, binds to abnormal protein deposits 
in the brain and is visible under a PET scan. 
In the former players, it revealed a smat- 
tering of tau and amyloid protein deposits 
distributed in a pattern that resembles CTE 
pathology and can be reliably distinguished 
from the plaques and tangles found in Al- 
zheimer’s disease, the team reported. 

Stern is not convinced, saying that the pat- 
tern of FDDNP-labeled areas that the UCLA 
team saw in the PET scans “is not consistent 
with the neuropathological findings” that 


meeting, he presented preliminary PET data 
from former NFL players showing that T807, 
a compound that he says binds more spe- 
cifically to tau than FDDNP does, detected 
deposits of the protein in the brain’s corti- 
cal folds in a pattern similar to that seen 
in postmortem tissue. “I am confident that 
within the next five to ten years there will 
be highly accurate, clinically accepted, and 
FDA-approved methods to diagnose CTE 
during life,’ Stern wrote in an October 2014 
affidavit to the lawsuit filed against the NFL 
by retired players. 

The stakes are high for those players. 
Under the current settlement with the re- 
tired athletes, the NFL will compensate 


came out of the consen- only those diagnosed 
sus necting 1 Mh LLL with Alzheimer’s disease 
Because FDDNP binds Too much tau? or a “neurocognitive 


to so many different sub- 
stances in the brain, it is 
not an ideal compound 
for imaging tau’s presence, 
adds Patrick Bellgowan, 
a program director at 
the National Institute of 
Neurological Disorders 
and Stroke (NINDS) in 
Bethesda, Maryland. The 
UCLA group stands by its 
findings, however, saying 
that FDDNP does not need 
to bind exclusively to tau 
in order to provide a reli- 
able picture of CTE’s pro- 
gression and distinguish 
it from other neurodegen- 
erative diseases. 

The PNAS report also 
raised eyebrows because 
its authors include Rob- 
ert Fitzsimmons, a per- 
sonal injury lawyer who in 
1999 represented the late 
Hall of Fame center Mike 
Webster in a disability 
lawsuit against the NFL. 
Although Fitzsimmons is 
not involved in the class 
action suits, which involve 
more than 4000 former players, Webster was 
the first NFL player to be diagnosed with 
CTE after his death, and many credit his case 
with launching the current suit. In a state- 
ment, the UCLA team said that Fitzsimmons, 
who is a director at TauMark, “had signifi- 
cantly contributed to the design of this study 
because of his broad experience with concus- 
sions and brain damage.” (He and two other 
co-authors on the PNAS paper founded the 
Brain Injury Research Institute in 1996.) 

Despite the field’s bumpy progress, Stern 
is convinced that diagnostic tools for CTE 
in living people are within reach. At the 
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Brain scans of former NFL players 
highlight a disputed tag (red) for 
protein deposits. 


disorder,’ Stern says. A 
player who could receive 
$1 million if he has an 
Alzheimer’s diagnosis, 
for example, might re- 
ceive less than half that, 
or nothing all, because 
CTE can’t yet be defini- 
tively diagnosed. The 
settlement is now in final 
negotiations, but as writ- 
ten it can be revised to in- 
corporate new diagnostic 
criteria for CTE only ev- 
ery 65 years, Stern says. 

The rush to find new 
diagnostic tools makes 
it easy to forget that re- 
search into CTE “is just at 
the starting line,’ Smith 
cautions. Tau deposits 
may only be a shadow or 
aftereffect of head injury, 
and not a cause of symp- 
toms, for example. Fun- 
damentally, Smith says, 
“we need to explore what 
shifts you from a normal 
aging track to a neuro- 
degenerative track.” 

To tackle that ques- 
tion, scientists must follow large groups of 
people with concussions and other head in- 
juries, ideally until they die and their brain 
tissue can be examined, Bellgowan says. 
Research groups funded by the NFL and 
NINDS are already looking for markers of 
CTE in blood samples and brain tissue from 
thousands of people enrolled in an ongoing 
study funded by the National Institute on 
Aging, he says. And Bellgowan adds that 
NINDS is reviewing a fresh round of grant 
proposals aimed at detecting CTE and de- 
fining its progression. The clock is ticking 
on Stern’s prediction. @ 
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Plan for E.U. 
research funds 
raises ire 


European Parliament vows 
to shield science budget 
from stimulus package raid 


By Tania Rabesandratana 


battle has erupted in Brussels over 

the European Commission’s plan 

to raid research funds in a bid to 

boost Europe’s lagging economy. 

Announced in November, the plan 

involves diverting €2.7 billion from 
the European Union’s 2014 to 2020 re- 
search budget to create a new E.U. invest- 
ment fund. Member states like the idea, 
but scientists protested—and now the Eu- 
ropean Parliament appears to have heard 
them. Earlier this week, it voted to oppose 
raiding the research budget, setting the 
stage for lengthy negotiations with the 
Council of Ministers, which represents 
member states. 

European Commission chief Jean-Claude 
Juncker and Carlos Moedas, the European 
Union’s research commissioner, have in- 
sisted that researchers have no cause 
for alarm. First, the commission says the 
money diverted for the investment fund, 
known as the European Fund for Strategic 
Investments (EFSD, represents “only 3.5%” 
of the overall budget of Horizon 2020, the 
European Union’s 7-year research fund- 
ing plan. Second, the commission claims 
that the money will not be lost to science: 
“On the contrary, this is money that will 
be used to attract much more important 
sums [from national governments and pri- 
vate investors] that will then be reinvested 
in innovation,” the commission said in 
a statement. 

Scientists and research organizations 
don’t buy this argument. Universities won’t 
be able to use the money that is diverted, 
says the European University Association: 
Instead of supporting research grants, the 
funds would become seed capital for loans 
that many public organizations cannot use 
because they are not allowed to borrow 
money. Scientists are particularly incensed 
that the European Research Council, which 
distributes individual grants for funda- 
mental research, would lose €221 million. 
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In February, a group of Nobel laureates 
urged Juncker to reverse what they termed 
a “misguided and short sighted policy.” 
Cutting Horizon 2020 funds “send[s] a 
message that Europe is not the place to do 
high level science,” they wrote. 

Juncker’s plan requires the approval 
of both the Parliament and the Coun- 
cil of Ministers. However, in a vote on 
20 April, members of the European Parlia- 
ment (MEPs) agreed to set up EFSI but op- 
posed devoting Horizon 2020 funds to it. 
The Parliament “appears as the only E.U. 
institution defending Horizon 2020,” said 
Kurt Deketelaere, secretary-general of the 
League of European Research Universities, 
in a statement. 

“Tt is completely unnecessary to trans- 
fer money from Horizon 2020, says 
Kathleen van Brempt, a Belgian socio- 
democrat MEP who sits on the Parliament’s 
industry, research, and energy committee. 
Van Brempt instead wants EFSI to use un- 
spent E.U. funds that are normally returned 
to national governments at the end of each 
year. “There are large enough leftovers to pay 
for the [EFSI] guarantee fund,” which the 
Parliament could watch over year after year, 
she says. However, national finance minis- 
ters like their annual E.U. refund, and so will 
certainly oppose van Brempt’s plan during 
the coming weeks of “trilogue” negotiations 
between the commission, Parliament, and 
member states. Juncker said he wants to fi- 
nalize the plan before the summer. 

Although the Parliament has often flexed 
its muscles this way since its powers were 
extended in 2009, the Council of Ministers 
usually has the upper hand in negotiations 
over the E.U. budget. “We can expect weeks 
of high political games,’ Deketelaere says. 
“The question is who is going to give in.” 
The Parliament is showing a united front 
to protect research funds from cuts, but “I 
don’t know if that will hold until the end,” 
he adds. & 


European Commission chief Jean-Claude 
Juncker says scientists shouldn’t worry. 


380 24 APRIL 2015 + VOL 348 ISSUE 6233 


RESEARCH FUNDING 


House science chief unveils 
contentious vision for science 


Authorization bill would cut climate and energy research 
and reshape science agency policies 


By Jeffrey Mervis and Adrian Cho 


epresentative Lamar Smith (R-TX) 
has never hidden his desire to re- 
shape federal research policy— 
often over the objections of much 
of the scientific 
community—since 
he became chair of the 
House of Representa- 
tives science committee 
2 years ago. Last week, 
he introduced legislation 
that lays out those plans 
in unprecedented detail, 
and the reaction was pre- 
dictable. Although aca- 
demic leaders say that 
some parts of the new, 
189-page bill are better 
than previous versions, 
they believe it would se- 
riously damage the U.S. 
research enterprise. 

The bill not only sets 
out funding levels for several research agen- 
cies that in some cases depart sharply from 
those the Obama administration requested 
for 2016; it would also reshape key policies 
and priorities guiding those agencies. In par- 
ticular, researchers complain that the bill 
(H.R. 1806), called the America COMPETES 
Reauthorization Act of 2015, would: 


« Narrow the scope of research at the 
National Science Foundation (NSF) by des- 
ignating some scientific disciplines as 
more important to the nation than others; 

¢ Sharply reduce NSF’s authority to fund 
the social sciences and the geosciences; 

¢ Restrict NSF’s ability to build large new 
scientific facilities by requiring the 
agency to follow new, controversial, ac- 
counting practices; 

¢ Curtail climate change research at the 
Department of Energy (DOE); 

¢ Block the government from using DOE 
research findings in writing regulations; 

« And squeeze the budgets for DOE’s ap- 
plied research program and its fledgling 
Advanced Research Projects Agency- 
Energy (ARPA-E). 
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be affected by proposed geoscience cuts. 


Smith doesn’t have the authority to impose 
that vision on Congress. The Senate has yet 
to draft a matching authorization bill, and 
the path to congressional passage is uncer- 
tain. But his committee's oversight of several 
key scientific agencies means his ideas will 
play an important role in 
the debate. 

The first America 
COMPETES Act, passed 
in 2007 and renewed in 
2010, enjoyed biparti- 
san support from presi- 
dents George W. Bush 
and Barack Obama. In 
contrast, the new ver- 
sion has no Democratic 
co-sponsors, was not 
shown to Democratic 
committee members be- 
fore it was unveiled, and 
has triggered a bitter 
partisan debate. 

“The original Ameri- 
can COMPETES Act was 
one of the crowning achievements of the 
science committee,’ says Representative 
Eddie Bernice Johnson (D-TX), the top- 
ranked Democrat on the panel. “This bill is 
an America COMPETES bill in name only. It 
does nothing to further our scientific and in- 
novation enterprise.” 

Smith disagrees, saying the legislation “re- 
establishes the federal government’s primary 
scientific role to fund basic research [and] 
prioritizes taxpayer investments.” In offering 
additional spending—often at odds with what 
the administration has requested (see table, 
p. 381)—he cites the country’s need to catch 
up in supercomputing and particle physics 
and preserve its lead in other areas. Smith 
offsets those increases by cutting “later-stage” 
technology and commercialization programs 
that he believes “are more effectively pursued 
by the private sector” The bill authorizes 
spending levels for the 2016 and 2017 fiscal 
years, with no increases in the second year. 

Some science advocates object to Smith’s 
assertion that legislators can do a better job 
than the scientific community itself in identi- 
fying the most promising research areas. They 
are angry that he has proposed funding levels 
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for each of NSF’s seven direc- 
torates rather than giving 
the agency a top-level num- 
ber and allowing it to distrib- 
ute funding as appropriate. 
They are angrier still that 


COMPETEing visions for U.S. research 


A bill drafted by House Republicans matches White House spending 
priorities for science in some areas but diverges greatly in others. 


2016 funding levels, in $ billions 


Washington, D.C. But that 
assumption is unrealistic, 
he says. A 2008 APS study 
showed “that some federal 
money had to be provided 
because nobody else would 


he wants to cut more than ' oMAEuEs do it,” he notes. 
$100 million from two of mee cael eaare Smith and his Republican 
those research directorates— colleagues do back govern- 
geosciences and social sci- COMPETES would boost: ment support for early-stage 
ences—shrinking the latter Dele aban 0.420 0.488 16% technologies outside the en- 
by more than half. ergy arena. Last December, 
“T think it’s ironic that the SF biology 0.748 0.835 12% in the final 2015 spending 
science committee would : , bill for all federal agencies, 
mark up this bill on Earth pe adie nee ue Smith championed a net- 
Day,’ says Sherri Goodman, SF computer science 0.954 1.050 10% work of manufacturing in- 
president of the Consortium : ; novation research centers, 
for Ocean Leadership in SI OSE pelea col eerre ase’ EDDe us to be run by the National 
Washington, D.C., referring COMPETES would reduce: Institute of Standards and 
to a meeting this oS IST science 0755 0745 1% Technology, that would 
day of the panel. “Before tap money now going to 
they take such a drastic SF overall 7723 7597 -2% DOE’s EERE. The original 
step, I hope they reconsider : plan called for spending 
the adverse consequences it Stecieaan OMe 0.866 _ $250 million over a decade 
would have to the environ- DOE bio/environment 0.612 0.550 -10% on several new centers. But 
ment, the economy, and na- ; the COMPETES reauthori- 
tional security.” oases elles eeo Megs See zation would speed up that 
The bill also targets NSF’s IST 1.120 0.934 -17% timetable, making $150 mil- 
oversight of big new scien- = lion of the total available 
tific facilities. Smith and DEE tevew ales emicieusy BEE Hee See over the next 3 years. 
other Republicans believe ARPA-E 0.325 0.140 57% Research funding lev- 
that NSF has been lax in this ; ; els aren’t all that troubles 
regard, and the bill requires SF social/behavioral* 0.237 0.100 798% Lubell and other science 
the agency to “correct” any No change: advocates. The bill also pro- 
problems identified by an claims that “the results of 
independent audit of a proj- SF research account 6.186 6.186 0% any research, development, 
ect’s expected cost before DOE Office of Science 5.340 5.340 0% demonstration, or commer- 
starting construction. That cial application projects or 
language could seriously DOE advanced computing 0.621 0.621 0% activities of the [Energy] 
delay new projects, say NSF DOE Basic Energy Sciences 1.849 1.850 0% Department may not be 
officials, who add that the used for regulatory assess- 
bill also contains rules about DOE High Energy Physics 0.788 0.788 0% ments or determinations by 
the use of contingency funds DOE Nuclear Physics 0.625 0.625 0% Federal regulatory authori- 


that are at odds with existing 
federal policies. 

Within DOE’s Office of 
Science, the bDill’s impact 
would be limited to two of the office’s six re- 
search programs. It would move more than 
$60 million from biological and environ- 
mental research into fusion research. That 
money would presumably come out of DOE 
climate change efforts, as the bill would re- 
quire DOE to eliminate any climate research 
deemed to overlap with what other federal 
agencies are doing. 

When it comes to DOE’s applied research 
efforts, however, the new COMPETES act 
takes out a hatchet. It calls for cutting spend- 
ing on DOE’s energy efficiency and renew- 
able energy (EERE) program by 37% from its 
current level, to $1.2 billion in 2016. In con- 
trast, the White House has requested a 42% 
boost, to $2.7 billion. Similarly, it would slash 
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the budget for ARPA-E, devoted to translat- 
ing the best results from basic research to 
budding energy technologies, by 50%, to 
$140 million, rather than increase it by the 
16% the administration has sought. The 
House bill also zeros out numerous smaller 
applied research projects such as DOE’s next 
generation lighting initiative, building stan- 
dards program, and efforts to find additional 
uses for electric-car batteries. 

Those changes are in line with the argu- 
ment by many Republicans that private in- 
dustry, not the federal government, should 
pick up the tab for translating the most- 
promising basic research into commercial 
technologies, says Michael Lubell, a lobbyist 
with the American Physical Society (APS) in 
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ties.” That clause, although 
it appears within a section 
on fossil energy research 
and development, would 
prevent the federal government from us- 
ing any of the DOE research it paid for to 
inform policy. “That’s absolutely bizarre,” 
Lubell says. 

The committee was scheduled to take 
up the bill on Wednesday, and Democrats 
are expected to offer a raft of amendments. 
Although few, if any, are likely to pass, sci- 
ence advocates are hoping that the Obama 
administration will object to enough of 
the bill’s provisions to reject anything that 
manages to clear both houses of Congress. 
“At this point, this White House is poised to 
veto anything that threatens the president’s 
priorities,” Lubell says. “I don’t really think 
there’s a snowball’s chance in hell of getting 
the president’s signature on it.” & 
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MEDICAL RESEARCH 


Clinical trials get practical 


Many clinical trials don’t help doctors make decisions. 
A new breed of studies aims to change that 


By Jennifer Couzin-Frankel, in 
Philadelphia, Pennsylvania 


ith billions of dollars spent each 

year to test new drugs and de- 

vices, you would think clinical 

trials would help doctors treat 

the patient in front of them. But 

you would be wrong, says Robert 
Califf, a cardiologist at the U.S. Food and 
Drug Administration (FDA) in Silver Spring, 
Maryland. He highlights one salient ex- 
ample: Along with colleagues, Califf found 
that only 11% of the hundreds of guidelines 
for heart health were based on evidence 
from multiple randomized trials or meta- 
analyses. Many trials are too small or too 
poorly designed to tell us much, Califf says. 
Others don’t address what doctors need to 
know, such as how one particular treatment 
stacks up against another. And even the big- 
gest and best clinical trials tightly restrict 
who can sign up, casting doubt on their rel- 
evance to broad patient populations. 

Last week, 200 statisticians, scientists, 
and physicians gathered at the University 
of Pennsylvania to hash out some options 
for repair. A leading remedy is a new breed 
of experiments: pragmatic clinical trials, 
which focus less on disease biology and 
more on helping doctors. They do not hew 
to a single design. Rather, pragmatic tri- 
als are guided by their end goal: informing 
practice. They might take all comers, focus 
on specific questions doctors want to an- 
swer, or streamline data collection to make 
an enormous trial more feasible. “These 
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trials will involve more women, more mi- 
norities, a range of incomes,” says Monique 
Anderson, a cardiologist at Duke University 
in Durham, North Carolina. 

But any new path raises questions, as last 
week’s meeting made clear. They include 
how to approach informed consent ethically 
when trying to capture a broader swath of 
the population, whether the quality of data 
from sources like electronic health records is 
sufficient, and how easily results from these 
trials can be understood. 

Broad enrollment is a key feature of 
pragmatic trials. Traditional trials tend to 
have strict admissions criteria, because 
drug companies believe the resulting data 
will be more precise, the drug will perform 
better, and it will be more likely to be ap- 
proved. But in the real world, all sorts of 
patients take medication. “If you come into 
my emergency department, I can’t say, ‘You 
have one of these six things, I can’t treat 
you, ” says Roger Lewis, an emergency 
medicine physician at the University of 
California, Los Angeles, who studies in- 
novative trial designs. Once a treatment 
reaches a more diverse population—people 
with, say, diabetes or high blood pressure or 
obesity—there are often surprises about its 
safety and effectiveness. 

In February, the Patient-Centered Out- 
comes Research Institute in Washington, 
D.C., announced $64 million in funding for 
five pragmatic trials. And in 2012, the Na- 
tional Institutes of Health established the 
“Collaboratory” to study how pragmatic trials 
might play out. For now, pragmatic studies 
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One pragmatic clinical trial compares different 
approaches to dialysis. Studies like this will enroll a 
broader cohort, including more women and minorities. 


tend to focus on health behaviors or compare 
available treatments, not test experimental 
drugs, although that could change. 

Nine Collaboratory trials are under way. 
One tests whether patients on dialysis are 
more likely to survive and stay healthier 
if the dialysis treatment itself lasts longer. 
The study is randomizing about 400 dialysis 
centers around the country to either con- 
tinue with their usual routine—dialysis typi- 
cally ranges from about 3 to 5 hours in the 
United States—or administer it for at least 
4.25 hours. Patients receive information 
about the trial at their clinic and a toll-free 
number to call if they have questions for the 
research team or wish to opt out. 

An opt-out model is an option only for 
some of the lowest risk clinical trials: U.S. 
regulations require active informed consent 
for studies of experimental drugs. Because 
current pragmatic trials are comparing ap- 
proaches doctors already use routinely, even 
ethicists agree that enrolling everyone, un- 
less someone objects, is often reasonable. 

Other challenges come in figuring out 
the best way to design pragmatic studies, 
interpret the results, and consider how 
much uncertainty is tolerable. For exam- 
ple, the Women’s Health Initiative (WHD, 
which reported on randomized trials of 
hormone replacement therapy more than 
a decade ago, is now running a pragmatic 
trial that relies on Medicare data to tell it 
whether 26,000 women who receive mail- 
ings and coaching to encourage exercise 
are less likely to develop heart problems. 
It’s a study so large that funding it would 
likely have been a stretch without plugging 
into health records. But there are disadvan- 
tages, too, explained Garnet Anderson, one 
of WHI’s leaders and a biostatistician at the 
Fred Hutchinson Cancer Research Center 
in Seattle, Washington, at last week’s meet- 
ing. Medicare data are designed for billing, 
not research, and the data have gaps. “What 
kind of errors can we tolerate?” she asked at 
the podium. “I don’t know.” 

Regulators still need to consider whether 
and how pragmatic trials could be used to 
test experimental drugs. Califf, who recently 
left Duke University to become a deputy com- 
missioner at FDA, says he plans to push for 
change. Although it’s hard to argue against 
more practical trials—and most don’t— 
companies may worry about retooling an en- 
trenched model, and some scientists might 
be concerned about too heavy a focus on 
pragmatism and too little on basic biology. 
“T intend at FDA to fight that battle,’ Califf 
says, “and see how successful I can be.” & 
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ECOLOGY 


Inbred wolf population on 
Isle Royale collapses 


But other wolves adopt a new Lake Superior island 


By Christine Mlot 


ach January, before they fly to snow- 
bound Isle Royale in Lake Superior, 
ecologists Rolf Peterson and John 
Vucetich guess how many wolves 
they'll spot. This U.S. national park in 
Michigan is home to the world’s longest 
running predator-prey study, of wolves and 
moose, but last year the wolves had dwindled 
to nine due to complications of inbreeding. 
This year, Peterson figured they’d likely find 
a mere seven wolves. But the island held only 
three, as the team from Michigan Techno- 
logical University in Houghton announced 
last week. “The collapse of the wolves was 
beyond our expectation,” Peterson says. 

The three wolves included a pair, probably 
the last known to have reproduced, plus a 
smaller potential pup. The other wolves are 
presumed to have either died or left the is- 
land last year, in a reverse of how carnivores 
originally came to Isle Royale, when a bit- 
ter winter froze the channel to the mainland 
(Science, 24 May 2013, p. 919). 

But even as the famed predator- 
prey study on Isle Royale ap- 
pears to be on its last legs, other 
researchers may have caught the 
birth of a similar natural experi- 
ment. Across the lake in Canada, 
three mainland wolves crossed the 
ice to a smaller island with differ- 


The fate of the island’s other wolves is 
hard to pin down. One male wore a radio 
collar and was spotted dead; his carcass will 
be collected this spring for autopsy. If all the 
others died, the 70% mortality rate for the 
year would be the highest in the study’s his- 
tory, Peterson says. So the wolves may have 
simply left via the icy corridor to the main- 
land, seeking unrelated mates. When the 
channel froze last year (for only the second 
time in 16 years) scientists learned later that 
a female identified by her collar crossed to 
the mainland and was shot. 

This year, an ice bridge formed again and 
for the first time scientists got a real-time 
view of how Isle Royale’s wolves arrived. 
Genetic analyses have revealed after the 
fact that such immigrations and resulting 
matings must have happened several times 
during the study’s 57 years. In February, 
Vucetich spotted two mainland wolves on 
the island. One, a female, fortuitously was 
fitted with a radio collar as part of a study 
by the Grand Portage, Minnesota, band of 


A tale of two islands 


The iconic predator-prey study on Isle Royale may be ending, as only 
three wolves remain, including a pair and what may be their 
malformed pup (last in line in photo). But scientists have started 
anew study on Michipicoten Island, home to caribou and a trio of 
wolves recently arrived from the mainland. 


Lake Superior Chippewa. The two visitors 
canvassed the southwest edge of Isle Royale 
for 5 days and then headed back to the 
mainland. The radio-collared Minnesota 
wolf next cruised two nearby small islands, 
perhaps seeking easier prey than a 400-kg 
moose. “That’s how [wolves] get informa- 
tion,” Peterson says. “They walk.” Peterson 
thinks the newcomers were aware of the 
resident wolves, but that mating would be 
unlikely given the few residents and the 
pair bond of two of them. 

Roughly 300 kilometers across the lake 
from Isle Royale, a trio of wolves might have 
found the Goldilocks option in the forests of 
184-square-kilometer Michipicoten Island 
Provincial Park in Canada. Three mainland 
wolves appear to have colonized the island 
via ice and have probably bred. Based on 
aerial surveys, Patterson estimates that 
Michipicoten holds 250 to 300 “predator- 
naive” woodland caribou, which are smaller 
than moose but bigger than deer. Work- 
ing for the Ontario Ministry of Natural 
Resources and Forestry, Patterson and his 
colleagues radio-collared all three wolves 
in February and are preparing for a po- 
tential Isle Royale redux, Patterson was to 
announce this week at the Midwest Wolf 
Stewards meeting in Ashland, Wisconsin. 
As on Isle Royale, where wolves curbed the 
moose population and so helped preserve 
the vegetation, “the wolves are expected to 
have a profound impact,” Patterson says. 

Back on Isle Royale, it’s no sur- 
prise that moose are booming. 
This year their numbers swelled 
by an estimated 200 to about 
1250, despite the harsh winter. 
Wolves are so scarce that their 
effect on moose has been essen- 
tially nil for the past 4 years, ac- 
cording to Peterson. 


ent prey and seem to have settled Geneticists including Philip 
in, population ecologist Brent 100 km Hedrick of Arizona State Univer- 
Patterson of Trent University in sity, Tempe, advocate introduc- 
Peterborough, Canada, was slated - Lake ing new breeding pairs of wolves 
to report at a meeting this week. aad Bena to Isle Royale. “These animals 

The wolves on Isle Royale once MINES OT Isle Royale Michipicoten might give the wolf population a 
numbered as many as 50. But they Island new start and the remaining two 
have been mostly isolated for gen- adults might eventually contrib- 
erations and the population has ute to this population,” he says. 
been overrun with spinal defects, Wee one MiChIORN But National Park Service offi- 
likely from inbreeding. The pos- cials say they plan to continue 
sible pup seen this winter also their hands-off stance for the mo- 
displayed an abnormally short ment. An environmental impact 
tail with raccoonlike stripes and a S analysis and request for public in- 
hunched back—perhaps due to the put on how to manage the wolves, 
lack of genetic diversity. “It [didn’t] moose, and forest is to launch 
look particularly healthy” in Feb- , this spring. For now, Isle Royale’s 

be 


ruary, Vucetich says, and may be 
dead by now. He and Peterson col- 
lected frozen wolf scat for DNA 
analyses to identify the trio. 
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few wolves roam alone. & 


Christine Mlot is a science writer 
based in Madison. 
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SCIENTIFIC PUBLISHING 


NIH’s peer review stands up to scrutiny 


Analyses show better scored proposals produce more papers and citations 


By Jeffrey Mervis 


he debate over whether peer review 

can pick out the research most wor- 

thy of funding has heated up in the 

past decade as competition for fed- 

eral dollars has become more intense. 

Two new studies support claims that 
peer review works at the National Institutes 
of Health (NIH). But some who follow the 
peer-review debate say the papers’ definition 
of success—three outcomes traditionally val- 
ued by the scientific community—ignores 
important factors, meaning that the debate 
is sure to continue. 

One study, on page 434, examined the 
outcomes of 137,215 NIH research project, 
or ROI, grants awarded between 1980 and 
2008. It found that grant proposals rated 
more highly by NIH study sections gener- 
ated more publications and more citations 
than those that received lower scores. A 
second study, published online this month 
in Research Policy, found that the addi- 
tional proposals funded after the agency 
received billions of dollars from the 2009 
economic stimulus package garnered fewer 
publications and citations than the grants 
initially funded. 

“Experts add value,’ says economist 
Danielle Li of the Harvard Business School 
in Boston, an author of the Science study. “It 
has something to do with their ability to see 
quality outcomes before they happen.” 

The head of NIH’s massive grant-review 
enterprise, Richard Nakamura, agrees that 
the research appears to bolster the case for 
enlisting thousands of scientists as review- 
ers. But the data are hardly definitive, he 
says. The Science paper “says that, unlike 
what other studies have found, there is a 
relationship between scores and outcome 
measure if you look at enough grants,” 
Nakamura says. “But it’s a very noisy 
measure. And the debate over how to mea- 
sure the outcome of grants remains very 
much alive.” 

The standard critique of peer review is 
that it works reasonably well in separating 
the wheat from the chaff, but that study sec- 
tion reviewers are less capable of making 
fine distinctions between two meritorious 
proposals. For the biomedical community 
served by NIH in particular, discontent 
with peer review has intensified over the 
past decade as success rates for project 
grants fell from one in three to nearly one 
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in six. The search for alternatives includes 
innovative ways of removing study sec- 
tions from the allocation of grant funding 
(Science, 7 February 2014, p. 598). 

Out of an endless number of possible out- 
come metrics with which to evaluate NIH’s 
current reviewing process, Li and Leila Agha, 
who is at Boston University, chose perhaps 
the most conventional. They opted for the 
number of papers generated by a funded 
grant, how often those papers 
were cited by others, and what 
papers were among the most 
cited of the year. Their analysis 
showed that a proposal with a 
score one standard deviation 
above a second proposal re- 
sulted in 8% more publications, 
17% more citations, and 24% 
more high-impact publications. 
(Factors such as an investiga- 
tor’s publication history, years 
since degree, and previous NIH 
funding cause the numbers to 
vary, but they remain statisti- 
cally significant.) 

The Research Policy study 
also gives NIH’s peer-review 
system a pat on the back. 
Researchers from the Geor- 
gia Institute of Technology in 
Atlanta and Drexel Univer- 
sity in Philadelphia treated 
2775 awards NIH made with 
stimulus funding as a natural 
experiment, comparing them 
with 9779 regular grants. 
They found that those grants 
made after NIH lowered the 
pay line—the score divid- 
ing funded from nonfunded 
proposals—generated fewer 
publications and _ citations 
than did projects that had sur- 
vived the initial cut. The obvi- 
ous implication: Reviewers knew what they 
were doing when they failed to fund the 
proposals the first time around. 

Neither paper distinguishes between de 
novo applications—some 56% of the grants 
that Li and Agha examined—and renewals 
to continue work that NIH is already fund- 
ing. That distinction is important, some 
scientists contend, because it’s much easier 
to judge the value of research with a track 
record. “Peer review works very well in as- 
sessing past and present performance,’ says 
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Proof positive for 


peer review? 

A one-standard devia- 
tion improvement in NIH 
study section scores 
among awarded grants 
is associated with rises 
in some outcomes. 


8% 


more publications 


a 


more citations 


ae 


more high-impact 
publications 


<7 


more follow-on patents 


Michael Lauer, head of cardiovascular sci- 
ence at NIH’s National Heart, Lung, and 
Blood Institute in Bethesda, Maryland, 
whose work has questioned his institute’s 
ability to pick the best research. “But it’s 
much less good at making predictions” 
about whether an investigator’s novel ap- 
proach to a problem will bear fruit, he adds. 
Lauer notes that some NIH institutes are pi- 
loting approaches that bet on people rather 
than projects, a strategy that 
may place less importance on 
publications and citations. 

Study section review scores 
are not the only way NIH de- 
cides how to invest scarce re- 
sources. Program managers 
must balance research port- 
folios across several fields, 
decide how large every award 
will be, and weigh whether 
funding someone with no 
other grants is likely to yield 
more science than adding to 
the pot of a well-heeled in- 
vestigator. To capture those 
factors, Lauer says, he prefers 
to use return on investment— 
citation impact per million 
dollars spent—as a metric for 
research outcomes. 

Nakamura says he worries 
about judging outcomes us- 
ing publications and citations, 
because journal editors and 
authors have _ considerable 
control over those variables. 
But he’s intrigued by another 
outcome metric in the Science 
paper that falls outside the 
traditional realm of academic 
science. Li and Agha found 
that the number of patents 
spawned by a funded grant 
correlated with proposal 
scores—better reviews led to more patents. 
The relationship is not direct, however: The 
researchers counted patents that cited pub- 
lications that in turn discuss other papers 
done under an NIH grant, not the grant 
that led to the patent. 

Lauer hopes these new results will help 
NIH improve peer review by relying on vet- 
ted research rather than on anecdotes or 
gut instincts. “Their work allows us to talk 
about those options by using data rather 
than opinion,” he says. & 
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By Daniel Clery 


t 25 years of age, the Hubble Space Telescope 

is still in its prime. Hubble’s instruments are 
fully functional, and the orbiting observatory 
keeps cranking out new results. “It’s at a peak of 
productivity,” says Paul Hertz, director of NASA’s 
astrophysics division. 

Astronomers have published more than 13,000 papers 
based on Hubble data—“more than any other telescope 
in history,’ says Matt Mountain, president of the As- 
sociation of Universities for Research in Astronomy 
in Washington, D.C., and former director of the Space 
Telescope Science Institute. It has helped measure the 
age and expansion of the universe, shown the ubiquity 
of supermassive black holes at the hearts of galaxies, 
watched comet Shoemaker-Levy 9 crash into Jupiter, 
and imaged some of the first galaxies. The cosmic photo 
album it has assembled includes some of the most 
memorable astronomical images ever. 

Today, Hubble continues to do what it has always done: 
tirelessly snapping pictures of everything from nearby 
asteroids to the most distant active galactic nucleus. The 
best ground-based telescopes, with mirrors four times as 
wide as Hubble's and adaptive optics to correct for Earth’s 
atmosphere, can now match Hubble’s resolution, but 
Hubble can apply its keen eye over a wider field of view 
and still see much fainter objects. And it’s never held up 
by clouds or sunrise. But what has really kept Hubble at 
the cutting edge is the five servicing missions by shuttle 
astronauts. Thanks to a series of repairs and upgrades— 
starting with a 1993 fix for its faulty mirror, which threat- 
ened to nip the whole mission in the bud—Hubble in 2015 
is a better telescope than the one launched 25 years ago. 

But all good things must end. With no space shuttle 
to service it, Hubble’s demise is expected sometime next 
decade. Its instruments could fail, or its gyroscopes could 
conk out, leaving the observatory adrift. NASA remains 
confident that Hubble will survive to overlap for at least 
a few years with its successor, the James Webb Space 
Telescope (JWST), due for launch in 2018. Estimates of 
Hubble’s orbital decay suggest that disposal, perhaps with 
the help of a remote-controlled spacecraft, won’t be neces- 
sary before the 2030s. 

As Hubble approaches those final years, researchers 
are turning their attention to what comes next. So far, the 
quest for a successor has faced rough going. JWST, the 
heir apparent, has been plagued by delays and cost over- 
runs, sapping funds and support from other missions (see 
main story, p. 388). But even without those constraints, 
Hubble would be a tough act to follow. m 


Jewels from Hubble’s trove. Clockwise from top left: Sombrero galaxy; 
newborn stars in N90 nebula; tower of cold gas and dust, Eagle nebula; 
hot gas jets, NGC 6302 (top); “Cat's Eye" nebula (bottom); pillar of gas 
and dust, Carina nebula. 

PHOTOS: (CLOCKWISE FROM TOP LEFT) NASA/ESA AND THE HUBBLE HERITAGE TEAM 
(STSCI/AURA); NASA, ESA, AND THE HUBBLE HERITAGE TEAM (STSCI/AURA)-ESA/HUBBLE 
COLLABORATION; NASA, ESA, AND THE HUBBLE HERITAGE TEAM (STSCI/AURA); NASA, ESA, 


AND THE HUBBLE SM4 ERO TEAM; NASA, ESA, HEIC, AND THE HUBBLE HERITAGE TEAM 
(STSCI/AURA); NASA, ESA, M. LIVIO, AND THE HUBBLE 20TH ANNIVERSARY TEAM (STSCI) 
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Engineer at NASA's Marshall Space Flight 
Center inspects primary mirror segments 
destined for the Webb Space Telescope. 


After Hubble 


The Webb telescope’s troubled history poses challenges for other 
contenders to replace the world’s most popular space telescope 


wenty years ago, when it was five, 
the Hubble Space Telescope took 
aim at a patch of sky so small it 
was almost empty of foreground 
stars and snapped the same image 
over and over for 10 days straight. 
When combined into a single de- 
tailed image, known as the Hubble 
Deep Field, the view transformed 
astronomers’ knowledge of the early uni- 
verse. The long total exposure, coupled with 
Hubble’s unmatched resolution, revealed 
3000 distant galaxies, some so red and faint 
that they dated back much of the way to the 
big bang. Hubble bettered the Deep Field in 
2004 with the Hubble Ultra Deep Field and 
again in 2012 with the Hubble eXtreme Deep 
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Field, which showed some galaxies as they 
were 13.2 billion years ago—just 600 million 
years after the birth of the universe. 

Now, as Hubble enters its final years, the 
astronomy community is wondering what 
instruments will deliver such revelations 
in the future. Hubble inaugurated an era 
of space-based Great Observatories, each 
probing a different part of the spectrum: 
the Compton Gamma Ray Observatory, the 
Chandra X-ray Observatory, and the infra- 
red Spitzer Space Telescope. Each delivered 
its own revelations, from gamma ray bursts 
in the distant universe to far solar systems 
taking shape; all are now aging or defunct. 
NASA's one big plan for a follow-up space 
observatory, the James Webb Space Tele- 
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scope (JWST), has survived a near-death 
experience and is now on track for a launch 
in 3 years—but at a cost so steep, during 
a time of stagnant government funding, 
that it has squeezed out other missions or 
pushed them farther into the future. 

“It’s been a struggle,” says Michael Turner, 
director of the Kavli Institute for Cosmolog- 
ical Physics at the University of Chicago in 
Illinois. “This decade will be remembered 
for amazing discoveries and growth in the 
field, but new missions have not kept up 
with opportunities.” As researchers begin 
to identify priorities for the next decade, 
they will have to make some tough deci- 
sions about where astronomy should focus 
its gaze. 
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IF ALL HAD GONE AS PLANNED, the Webb 
telescope would already be sending home its 
own dazzling views of the universe. In their 
2000 decadal survey, part of a regular prior- 
ity setting by U.S. astronomers, they rated 
the Next Generation Space Telescope—later 
named after former NASA Administrator 
James Webb—as their number one choice. 
JWST isn’t a direct successor to Hubble: 
Whereas Hubble is sensitive to visible wave- 
lengths plus small bands of ultraviolet and 
near-infrared, JWST ranges from orange and 
red visible light to mid-infrared. 

That window on the cosmos will allow 
JWST to scrutinize light from some of the 
very first stars and galaxies, which the ex- 
pansion of the universe has shifted into 
the infrared. Infrared light can also pen- 
etrate gas clouds that obscure areas where 
stars and galaxies form; some exoplanets 
and their atmospheres glow brightly in the 
infrared, as well. The Spitzer telescope, 
which ran out of liquid-helium coolant in 
2009 and is now largely blind, made similar 
studies but was limited by its modest 85- 
centimeter mirror. JWST will capture infra- 
red light with a segmented mirror 6.5 meters 
across. That increase “is an amazing leap 
from one generation to the next,’ says JWST 
Director Eric Smith. “JWST will look at [star- 
forming] clouds with Hubble-like sharpness 
of vision.” 

NASA initially estimated that JWST would 
cost $1.6 billion and reach orbit in 2011. 
Design and technology development work 
began, but in 2005, spiraling costs forced a 
replanning of the project that pushed the 
launch back to 2013 and hiked the life cycle 
cost to $4.5 billion, including hardware, 
launch, and 10 years’ operation. By 2010, the 
project was meeting all its technical goals, 
but schedule slips and budget overruns 
were building up again. An independent re- 
view criticized the project’s budgeting and 
management. In 2011, when NASA reported 
to Congress that the launch would likely slip 
to 2018 and the cost total more than $8 bil- 
lion, the House of Representatives appro- 
priations committee responsible for science 
voted to cancel the program. 

To save the mission, astronomers threw 
themselves into months of lobbying, letter 
writing, petition signing, and public out- 
reach. “Cancellation would have been a di- 
saster,’ Turner says. “Great nations do great 
things, and this is exactly what the United 
States should be doing.’ In a deal over the 
2012 budget, Congress restored JWST fund- 
ing but with conditions: Management of the 
project was reformed and the cost capped 
at $8 billion. In return, lawmakers prom- 
ised not to tinker with the budget each year. 
“They’ve kept up their end of the deal ... and 
we're grateful for that,” Smith says. 
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Astellar career 

The Hubble Space Telescope’s CV would 
fill volumes. Here are just a few mile- 
stones and a glimpse of the future: 


Hubble is launched 


First images are blurry because of a 
warped mirror 


Servicing mission 1 fixes mirror 


First clear images released 


“Pillars of Creation” photo released 


Hubble Deep Field images released 


Servicing mission 2 


Hubble measures expansion of the 
universe using Cepheid variable stars 


Hubble goes offline for about 
2 months after four of its six 
gyroscopes fail 


Servicing mission 3A makes Hubble 
operational again 


First observation of an exoplanet 
atmosphere (HD 209458 b) 


Servicing mission 3B 


Space shuttle Columbia disintegrates 
on reentry; shuttle fleet grounded 
for 2.5 years 


Hubble Ultra Deep Field released 


Servicing mission 4 


Hubble discovers a supernova split 
into four images by a gravitational lens 


Hubble's successor, James Webb 
Space Telescope, to launch 


Hubble stops operating 
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Today, JWST remains on schedule and 
within budget. Despite delays and un- 
expected complications, the project still has a 
10- to 11-month schedule cushion in reserve. 
“That’s more contingency than was planned 
for at this stage,’ Smith says. More than 99% 
of the spacecraft’s hardware is either built or 
being made, but next comes the tricky task of 
putting it all together. This year, Smith says, 
the scope’s mirror support structure, or back- 
plane, will be delivered and the 18 hexagonal 
mirror sections—each 1.3 meters across—put 
in place. A row of three mirrors on each side 
of the backplane has to fold back like leaves 
of a folding table so that the mirror will fit 
into a rocket fairing, but the backplane 
must still hold the mirrors steady to within 
1/10,000 the width of a human hair even 
while it is cooled to the -240°C of deep space. 

The telescope will have to operate for at 
least 5 years in an orbit 1.5 million kilometers 
from Earth, beyond the reach of servicing 
missions like those that prolonged the life of 
Hubble, so engineers are planning a series 
of tests simulating the rigors of deep space. 
The biggest will come in 2017, when the en- 
tire telescope and its instruments will enter 
the cavernous Chamber A at NASA’s Johnson 
Space Center in Houston, which can recreate 
the vacuum of space as well as the cycles of 
extreme heat and cold. This year, a working 


Probing the depths’ . 


model of the telescope will go through the 
same process three times. “Something could 
happen. You can never guarantee it won't,” 
says Paul Hertz, director of NASA’ astro- 
physics division. “Our job is to make sure 
that it doesn’t happen.” 


WHILE JWST HAS BEEN BROUGHT BACK 
from the brink, the project’s delays and in- 
flated cost have hindered other plans for ma- 
jor astronomy missions. “We always knew we 
couldn’t start a new big mission until JWST 
was finished, Hertz says. One casualty is 
the top priority of the 2010 decadal survey: 
the Wide-Field Infrared Survey Telescope 
(WFIRST), an instrument designed to study 
the nature of dark energy, the mysterious 
force that appears to be accelerating the ex- 
pansion of the universe. 

“As soon as we released the [decadal sur- 
vey] report, NASA told us about major cost 
problems with JWST,’ says Roger Blandford 
of Stanford University in Palo Alto, Califor- 
nia, who headed the survey. “With JWST 
needing more spending to complete, there 
was less funding to spend than the [decadal] 
survey had assumed,” Hertz admits. At the 
same time, the financial downturn squeezed 
NASA, causing the astrophysics budget to 
dip from up to $1.5 billion a year during the 
last decade to about $1.3 billion now. “We are 


. 


operating 10% lower than the historical aver- 
age,” Hertz says. 

So WFIRST entered a phase of suspended 
animation. Hertz says the delay has had posi- 
tive side effects. In 2012, the U.S. National 
Reconnaissance Office offered NASA two 
surplus 2.4-meter mirrors built for spy sat- 
ellites. Now astronomers are reconfiguring 
WFIRST’s design to accommodate a mirror 
more than a meter wider than originally 
planned and adding new capabilities, such 
as directly observing exoplanets. The result, 
they concluded earlier this year, will be a 
better telescope for almost no extra cost. 
Preparatory work continues, but “no formal 
decision has been made yet on when to go 
forward,’ Hertz says. 

The casualties also include two other pri- 
orities of the 2010 survey, the International 
X-ray Observatory (IXO) and the Laser Inter- 
ferometer Space Antenna (LISA), a detector 
of gravitational waves. Both were planned 
as collaborations with the European Space 
Agency (ESA), as Hubble itself is. “They 
seemed like dream collaborations. We'd 
learned how to work together,’ Turner says. 
But with NASA backpedaling on its commit- 
ment and ESA having funding troubles of 
its own, the Europeans didn’t select either 
mission for the launch slots they were aim- 
ing for. ESA has since resurrected [XO as 
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Better instruments and longer exposures are enabling astronomers to see ever deepér into space—and back 
in time past the “epoch of reionization,” when the primordial hydrogen fog of the “dark ages” cleared. 
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the Advanced Telescope for High-ENergy 
Astrophysics (ATHENA) and slated it for a 
2028 launch. NASA is discussing joining the 
project as a minor partner. LISA remains in 
limbo: Both agencies are considering going it 
alone with scaled-down versions, but they are 
still talking about joining forces for a launch 
in the 2030s. “LISA will rise from the dead. 
The science needs to be done,” Turner says. 

Astronomers continue to dream. Hertz, for 
example, has put forward a list of possible 
missions for the next decadal survey. They 
include a large telescope covering ultraviolet, 
visible, and near-infrared that would be 
a true successor to Hubble; a far- 
infrared telescope; a telescope for im- 
aging exoplanets; and an x-ray obser- 
vatory to follow ATHENA. This year, he will 
set up a few study teams to develop the ideas. 
“By the time of the decadal [survey], we'll 
have well worked-out concepts,” Hertz says. 

Others have also been laying groundwork. 
The Association of Universities for Research 
in Astronomy (AURA) in Washington, D.C., 
is studying a slightly different successor to 
Hubble: a large UV-visible light instrument 
it calls the High-Definition Space Telescope 
(see sidebar). HDST would both image gal- 
axies in the early universe with exquisite 
detail and aim to obtain spectra from Earth- 
like exoplanets. “Everyone wants to find an 
Earth,’ says Sara Seager of the Massachusetts 
Institute of Technology in Cambridge, co- 
leader of the study. 

Whatever plans take shape, astronomers 
will surely have to fight hard to get their mis- 
sions off the ground. More realistic costing 
and schedules, with plenty of contingencies, 
will be a must. But in the end, what saved 
JWST was the fact that its science goals are 
profound and a large number of scientists 
support it. “A mission has to appeal to a broad 
range of the community, and the American 
public has to buy into it,’ says AURA Presi- 
dent Matt Mountain. Achieving that broad 
constituency can be difficult, as the planners 
of HDST are finding as they try to marry the 
often diverging needs of exoplanet hunters 
and surveyors of the early universe. 

Ultimately, the prospects for future great 
observatories may all hinge on a few nerve- 
racking weeks late in 2018 when ESA will 
launch JWST and transport it to a point 
1.5 million kilometers from Earth. There it 
will unfurl its sunshield, piece together its 
mirror, and cool down its instruments. Al- 
though it’s all been practiced many times, 
it'll be “7 weeks of terror,’ Mountain says. “If 
they screw up, there is no way back, there’s 
no second chance,” he says. If, however, the 
telescope opens its eye on the universe and 
produces images as dazzling as Hubble’s, 
the future of orbital astronomy will also 
look brighter. = 
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Hubble on steroids 


By Govert Schilling 


s the Hubble Space Telescope’s 
decades-long mission winds down, 
astronomers are worried about the 
future of optical and ultraviolet 
astronomy from space. Hubble’s 
formal successor, the James Webb Space 
Telescope, is poised for launch in late 
2018 (see main story, p. 388), but it will 
be an infrared observatory. So what 
about a real “son of Hubble”? 
Astronomers started discussing plans 
for a “Very Large Space Telescope,” 
sporting an 8- to 10-meter mirror and 
possibly assembled in space, in 1989. 
“Those visionary ideas were probably 
ahead of their time,” says astronomer 
Garth Illingworth of the University of 
California’s Lick Observatory on Mount 
Hamilton. In June, a 17-person commit- 
tee commissioned by the Washington, 
D.C.-based Association of Universities 
for Research in Astronomy (AURA) 
will try again. Their report will outline 
a plan for a High-Definition Space 
Telescope (HDST), so large and capable 
that it would make even the largest 
ground-based telescopes obsolete. 
Given the budgetary and politi- 
cal woes of the Webb telescope, the 
proposal could still be ahead of its 
time. But it is designed to attract broad 
support from astronomers. “We want 
to make this a mission that works for 
everybody,’ says committee Co-Chair 
Julianne Dalcanton of the University of 
Washington, Seattle. Earlier proposals 
favored either astrophysics—the study 
of galaxies and stars—or exoplanet 
research, fields that require very differ- 
ent equipment. HDST would serve both 
camps. “We need to get both on the 
same side,” says AURA’ president, Matt 
Mountain. “If they work separately, they 
don’t get anything.” 
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One current concept for the 
High-Definition Space Telescope. 


For astrophysicists, HDST would 
be “transformational,” says commit- 
tee member Jason Tumlinson of the 
Space Telescope Science Institute in 
Baltimore, Maryland. Tumlinson 
says HDST will resolve structures a 
mere 300 light-years wide in every 
galaxy in the universe and will study 
star-forming regions no wider than 
Earth’s orbit. 

When trained on exoplanets, says 
Tumlinson’s colleague Marc Postman, 
HDST could spectroscopically detect 
the atmospheric constituents, includ- 
ing possible biosignatures, of an 
Earth-like planet within 60 light-years 
or so. “With HDST, the path will be laid 
for characterizing Earth 2.0,” he says. 

Postman envisions a 12-meter-class 
segmented, deployable mirror for 
HDST, comparable in design to the 
Webb telescope’s 6.5-meter segmented 
mirror. Wesley Traub of NASA’s Jet 
Propulsion Laboratory in Pasadena, 
California, prefers a cheaper, single- 
piece elliptical 8-by-3.5-meter mirror, 
to overcome potential problems 
with the segmented mirror design. 
Meanwhile, Illingworth would 
like the designers “to go bigger.” A 
25-meter space telescope could be 
built for far less than twice the cost 
of a 12-meter instrument, he says, 
although it would also require the 
development of new launchers. 

No one has formally estimated the 
cost of HDST. “It will be a multibillion- 
dollar project—that is about all one 
can say at this juncture,’ Postman 
says. If astronomers support it in their 
next priority-setting exercise, the 2020 
Astronomy and Astrophysics Decadal 
Survey, Hubble’s “true” successor could 
be on the launch pad in the mid-2030s, 
perhaps as a cooperative project with 
Europe and other countries. “I’m 
young enough to maybe get to use it,” 
Dalcanton says. & 
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Mutual dependence. Formerly free-living individuals can become physically, genomically, and metabolically integrated, such as shown in the cross section of Paracatenula 
flatworms colonized by intracellular Candidatus Riegeria symbionts (green). Host nuclei are shown in blue and storage compounds in red. 


EVOLUTIONARY BIOLOGY 


Evolving new organisms via symbiosis 


When and how do symbiotic partnerships become new, integrated organisms? 


By E. Toby Kiers' and Stuart A. West” 


ymbiotic partnerships are a major 
source of evolutionary innovation. 
They have driven rapid diversifica- 
tion of organisms, allowed hosts to 
harness new forms of energy, and 
radically modified Earth’s nutrient 
cycles. The application of next-generation 
sequencing and advanced microscopic tech- 
niques has revealed not only the ubiquity 
of symbiotic partnerships, but the extent to 
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which partnerships can become physically, 
genomically, and metabolically integrated 
(1). When and why does this integration of 
once free-living organisms happen? 

Many insects harbor endosymbionts— 
bacteria that live within the host’s cells 
(see the figure, panel A). Although separate 
organisms, they function as a metabolic 
unit. Such dependency can drive extreme 
genomic integration of host and symbi- 
ont at many levels. For example, species 
of mealy bugs depend on bacterial endo- 
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symbionts for nutrient provisioning, and 
the endosymbiont can in turn harbor its 
own endosymbiont (2). Patterns of sym- 
biont within symbiont dependencies oc- 
cur across animals, as well as plant hosts, 
which use photosynthesizing plastids as 
a source of energy (see the figure, panel 
B) (3). Endosymbionts can even speciate 
within their hosts, as has been found in 
Cicada insects (4). 

One potential outcome of host-symbi- 
ont integration is a reduction in symbiont 
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Major transitions 


Not major transitions 


Major transitions. (A to C) Symbiotic partnership resulting in major transitions in individuality: (A) endosymbionts 
Candidatus Hodgkinia (red) and Candidatus Sulcia (green) in a cicada host, with insect cell nuclei shown in magenta; 
(B) photosynthetic plastids of fern cells; (C) Paracatenula flatworm with intracellular C. Riegeria symbionts (red) and 
host nuclei (blue). (D to F) Symbiotic partnership where major transitions in individuality have not taken place: (D) 
giant marine tubeworm, Riftia, which is obligately dependent on nutritional symbionts gained during its larval stage; 
(E) nodules of legume Lathyrus japonicus housing N.-fixing rhizobial symbionts; (F) bioluminescent bobtail squid. 


genome size. A species of leaf hopper, Mac- 
rosteles quadrilineatus, harbors the endo- 
symbiont Nasuia deltocephalinicola, the 
smallest bacterial genome sequenced to 
date (5). Similarities in genome size, coding 
capacity, and the ability to import proteins 
produced by the host has driven a debate 
as to whether such endosymbionts are func- 
tionally equivalent to host organelles (6). 
Organelles evolved when free-living pro- 
teobacterial and cyanobacterial ancestors 
became incorporated into host cells, ulti- 
mately forming the mitochondria and plas- 
tids that power eukaryotes. Although the 
debate has largely focused on the genetic 
and cellular differences between endosym- 
bionts and organelles, it raises a larger 
evolutionary question: When do we expect 
symbiotic partnerships to evolve into new, 
integrated organisms? 

Maynard Smith and Szathmary’s major 
transitions approach provides a framework 
for addressing this question (7). It focuses 
on cases where groups of individuals that 
could previously replicate independently 
cooperate to form a new, more complex 
organism. Examples include the formation 
of chromosomes from the simplest replica- 
tors, of the eukaryotic cell from archaeal 
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and eubacterial cells, and of multicellular 
organisms through cellular cooperation. By 
asking what conditions favor these transi- 
tions, the framework emphasizes compari- 
sons across different levels of biological 
organization (8). 

The elegance of the major transitions 
framework is its simplicity. It argues that 
the same problem—how to overcome the 
selfish interests of individuals to form mu- 
tually dependent cooperative groups—has 
arisen and been solved at several crucial 
moments in history across all orders of life. 
At the same time, it recognizes that transi- 
tions in individuality are rare and require 
strict conditions: Partner interests need 
to be aligned and the benefits of more in- 
tegrated cooperation must lead to mutual 
dependence. 

How can group conflict be eliminated 
and loss of autonomy become favorable? 
Symbiotic partnerships involve two levels 
of potential conflict: between symbiont 
and hosts and among symbionts sharing 
a host. Hosts have evolved sophisticated 
mechanisms to manage their microbes, but 
the resulting symbioses do not necessarily 
eliminate group conflict (9). Furthermore, 
although repression of competition is nec- 
essary, it is not sufficient to drive a major 
evolutionary transition, which requires 
mutual dependence. This can happen when 
symbionts access new forms of energy for 
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their hosts, as with mitochondria, and the 
fitness of partners becomes so intertwined 
that obligate dependency is favored (0). 

Some symbioses are good candidates for 
major transitions. For example, Paracaten- 
ula flatworms are colonized by symbionts 
that harness chemical sources of energy 
(see the lead photo, page 392). The hosts 
have evolved such a level of symbiont de- 
pendency that they have lost mouths and 
digestive tracts (see the figure, panel C). In 
turn, the symbionts have reduced genomes 
and are passed directly from parent to off- 
spring (vertical transmission) (77). As with 
some insect endosymbioses (see the figure, 
panel A), selection has created organism- 
level adaptations, such as complementary 
genome modifications, that suggest suc- 
cessful and ongoing major transitions. 

To understand when and why new or- 
ganisms evolve via symbiosis, it is useful to 
also look at cases where major transitions 
have not been made, for example when 
there are asymmetries in dependence. The 
giant marine tubeworm Riftia lacks a di- 
gestive system as an adult and depends on 
a nutritional symbiont gained during the 
larval stage (12) (see the figure, panel D). 
Whereas the host has evolved a highly spe- 
cialized organ to house this symbiont, the 
bacterial partner retains a free-living stage, 
is transmitted horizontally, and has not ex- 
perienced major genome reduction. This 
suggests that although the partnership pro- 
vides benefits, there is sufficient conflict or 
favorable options outside the host to select 
against symbiont integration. 

Other examples where major transi- 
tions have not been made include legumes, 
which are provided with nitrogen by their 
rhizobia symbionts (see the figure, panel E), 
and squid, which obtain light for camou- 
flage from bioluminescent bacteria (see the 
figure, panel F). Both hosts acquire their 
bacterial symbionts directly from the envi- 
ronment, relying on complex signaling and 
coordinated molecular pathways to initiate 
symbiotic development. Yet despite strong 
coevolutionary histories, partners retain 
autonomy as individuals (13, 14). The ma- 
jor transitions framework suggests that this 
is because either strict mutual dependence 
is not beneficial or there is sufficient con- 
flict between partners or among symbionts. 
Thus, even when coevolution results in in- 
tricate cross-talk and specialized structures 
to house symbionts, this coordination does 
not necessarily imply a major transition to 
a new level of organism. 

Major transitions research suggests that 
the mode of transmission is key to which 
symbioses form new organisms. When the 
bacterial partner is acquired directly from 
the environment, as in the giant tubeworm, 
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squid, and legume symbioses, the host is 
working simultaneously with multiple gen- 
otypes. This allows hosts to exploit a wider 
range of environmental conditions, but also 
increases the potential for conflict among 
competing symbionts (75). To help manage 
conflict, specific structures in hosts (such 
as crypts in squid) effectively separate dif- 
ferent symbiont genotypes into individual 
chambers (73). In contrast, vertical trans- 
mission of symbionts from parent to off- 
spring, as in Paracatenula flatworms, can 
lead to a high relatedness between the sym- 
bionts within a host (75), linking the fitness 
of the symbiont to host performance, reduc- 
ing conflict, and selecting for integration. 

Ecological context can also help to pre- 
dict when and why transitions have been 
made (8). When benefits of partnerships 
vary with environmental context, mutual 
dependency is less likely to evolve. For ex- 
ample, in the legume-rhizobia symbiosis, 
the reliance on the bacterial partner de- 
creases in high-nitrogen environments (74). 
Here, strict dependency can be costly, mak- 
ing a major transition to a single, higher- 
level organism precarious and unlikely if 
benefits are not absolute. 

The advantage of the major transitions 
framework is that it emphasizes differ- 
ent questions from the mechanistic ones 
currently being asked. Interplay between 
evolutionary theory and genomic research 
will allow us to understand the evolution of 
organismal complexity within a single, uni- 
fied framework. & 
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GEOCHEMISTRY 


Biogeochemical tales told 
by isotope clumps 


Molecules with two or more heavy isotopes provide 
insights into diverse biological and geological phenomena 


By Benjamin H. Passey 


ow do you take a dinosaur’s tem- 

perature, reconstruct the elevation 

histories of Earth’s great mountain 

ranges, probe the workings of pho- 

tosynthesis, and confirm biological 

origins of a greenhouse gas? Increas- 
ingly, the answer lies in clumps. Clumped 
isotope geochemistry (J) is the latest 
branch of stable isotope geochemistry, the 
field that illuminated the Pleistocene gla- 
cial-interglacial cycles (2), the oxygenation 
of Earth’s atmosphere some 2.5 billion 
years ago (3), and the enigmatic presence 
of grass in hominid diets (4). In this issue, 
Yeung et al. (page 431) (5) and Wang et al. 
(page 428) (6) describe clumped isotope ef- 
fects that, among other things, can serve as 
tracers of biological versus abiological ori- 
gins of gases. 

Traditional isotope geochemistry is con- 
cerned with ratios of heavy to light isotopes. 
In contrast, clumped isotope geochemistry 
looks at occurrences of two or more heavy 
isotopes in the same molecule, such as *O,, 
*®C8O0, and “CH,D. In its decade of exis- 
tence, clumped isotope geochemistry has 
focused on the thermodynamic aspects— 
particularly the temperatures of molecular 
and mineral synthesis—recorded in isotopic 
clumps. Hence, dinosaur body temperatures 
have been estimated from clumped carbon- 
ate ions (°C'°O"%03-) in fossil teeth (7), and 
altitudes of ancient mountain ranges from 
C8003, in fossil soil carbonates (8). 

A simple game helps to understand iso- 
topic clumping. Here, we are not interested 
in the total amount of isotopic clumping, 
which simply scales with the isotopic com- 
position of the substance. Rather, we seek 
the deviation from amounts predicted by 
chance alone. The game is rolling the dice, 
and the goal is snake eyes—one pip facing 
up on both dice. The probability of snake 
eyes for regular six-sided dice is 1/6 x 1/6 = 
1/36, or 2.77%. The probability changes with 
the number of sides on each dice, becoming 
1/4 for two-sided dice and 1/10* for 100-sided 
dice. Changing the number of sides is like 
changing the isotopic composition (9). To 
win the game, we have to beat chance, and 
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changing the number of sides will not help. 

So far, clumped isotope geochemistry has 
been concerned with gaming the system 
on the positive side: coming up with more 
snake-eyes (or heavy isotope clumps) than 
predicted by chance alone. This situation is 
actually preferred by thermodynamics: un- 
der conditions of chemical equilibrium, lev- 
els of clumping will be slightly higher than 
levels predicted by chance alone. The isoto- 
pic “dice” are weighted by thermodynamics 
(see the figure). This enrichment in clumps, 
signified by A and on the order of a few parts 
per thousand, increases as molecular synthe- 
sis temperatures decrease (10, 11). 

Yeung et al. and Wang et al. now show that 
the system can also be gamed in the opposite 
direction: coming up with fewer snake-eyes 


“How can molecules 
seemingly evade both chance 
and thermodynamics? The 
answer appears to relate to 
the biological assembly of 
molecules from nonidentical 
substrate binding sites under 
irreversible conditions.” 


(or fewer heavy isotope clumps) than pre- 
dicted by thermodynamics and sometimes 
fewer than predicted by chance statistics. 
How can molecules seemingly evade both 
chance and thermodynamics? The answer 
appears to relate to the biological assembly 
of molecules from nonidentical substrate 
binding sites under irreversible conditions. 
Photosynthesis generates O, by combin- 
ing oxygen atoms from two water mol- 
ecules. The oxygen-evolving complex of 
Photosystem II has two water-binding sites 
that are thought to be nonidentical. It is 
therefore plausible that each site has a dif- 
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Isotope clumping and anticlumping in oxygen and methane 
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More than a roll of the dice. The appearance of more than one rare, heavy isotope in a molecule (a heavy isotope “clump”) is not random. Instead, the isotopic dice are weighted. 
Clumps are strongly favored under conditions of chemical equilibrium, whereas kinetic processes related to biosynthesis may have lower preference for clumps and in some cases 
may select against clumps, leading to “anticlumped” products. Natural methane gas shows both behaviors (6), whereas the isotope content of photosynthetic oxygen is random or 


anticlumped (i.e., below chance levels) (5). 


ferent affinity for %O relative to common 
16O (or “O relative to O). Yeung et al. pre- 
dict that if this is true, the amounts of *O, 
and “OO in photosynthetically produced 
O, will always be lower than levels pre- 
dicted by chance statistics. This astonish- 
ing outcome is true regardless of whether 
both sites prefer '%O, both sites prefer ‘O, 
or one site prefers °O and one site prefers 
189, The authors investigate this prediction 
by studying the evolution of clumped O, in 
a sealed terrarium filled with water hya- 
cinths. The experiment did not go exactly 
as envisioned: The plants eventually died 
and were replaced by an “algae-dominated 
ecosystem.” However, there was a clear 
lowering of *O, and "OO toward the “sub- 
chance” levels (negative A) predicted by the 
model for photosynthetic O,,. 

Wang et al. study clumped methane 
(°CH,D) from sources as diverse as cow ru- 
mens, deep sea hydrothermal vents, and the 
Marcellus Shale. In agreement with a study 
published last year (72), the geological gases 
had positive A values reflective of isotopic 
equilibrium at the temperatures of their 
sources. In contrast, methane from cow ru- 
mens, swamps, lakes, and lab cultures had 
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low levels of clumping which, if interpreted 
in the framework of thermodynamic equilib- 
rium, would suggest exceedingly high tem- 
peratures of methane synthesis—typically 
greater than 200°C. Such temperatures are 
nonsense, and clearly the thermodynamic 
equilibrium model is not applicable to this 
system. Prompted by these findings, Wang et 
al. developed a model of microbial methano- 
genesis that predicts discrimination against 
the heavy clumped species (8CH,D) rela- 
tive to molecules with single heavy isotopes 
(°CH,D, *CH,). This discrimination increases 
as the degree of irreversibility increases, 
which is proportional to the availability of 
the limiting substrate, here H,. Thus, the 
model predicts lower A values under higher 
H, and hence higher rates of synthesis. 

The clumped isotope anomalies will help 
place much-needed constraints on biogeo- 
chemical sources, sinks, and budgets of O, 
and CH,, perhaps even over glacial-intergla- 
cial cycles from gases trapped in ice cores. 
Similar effects are possible in other biogenic 
gases like nitrous oxide and ethane. Clump- 
ing in O, from human-made oxygen-evolving 
systems (13) could help to elucidate reaction 
mechanisms. And far away on Mars, the Cu- 
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riosity rover has detected transient whiffs of 
methane (/4). If the laser spectrometer of 
Wang et al. could be adapted for missions to 
Mars and beyond, it would be a major boon 
to the search for extraterrestrial life. m 
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SELF-ASSEMBLED MATERIALS 


Supramolecular lattices from 
tetrahedral nanobuilding blocks 


The location of hydrophilic and hydrophobic regions on tetrahedral nanoparticles 
determines how they pack together into a variety of structures 


By Shu Yang 


he self-assembly and packing of nano- 

scale particles that have nonspheri- 

cal shapes is only beginning to be 

explored. Recent computer simula- 

tions have suggested that hard, con- 

vex polyhedra can pack more densely 
than spheres, leading to complex structures 
of liquid crystals, plastic crystals, crystals, 
and disordered phases (1). In particular, 
aperiodic quasicrystalline and crystalline 
phases occur in the packing of tetrahedra, 
the simplest polyhedron (2). On page 424 
of this issue, Huang et al. report the precise 
synthesis of rigid giant tetrahedra consist- 
ing of hydrophobic and hydrophilic poly- 
hedral oligomeric silsesquioxane (POSS) 
cages (3). They assemble a diverse range of 
highly ordered supramolecular lattices by 
exploiting the location of the hydrophilic 
POSS cage substituent and the number of 
substitutions, together with the interplay of 
crystallization and the collective hydrogen- 
bonding interactions. 

The A15 phase was discovered in metal al- 
loys with A,B composition (4). As seen on the 
right side of the figure, it is a cubic lattice 
consisting of six A units (pale red spheres) 
in Kasper polyhedra with 14-coordination 
and two B units (dark red spheres) in icosa- 
hedra with 12-coordination. Its space group 
is Pm3n and it exhibits a two-dimensional 
4*-square tiling pattern. Relative to other 
superlattices, the A15 phase is more loosely 
packed and can better accommodate distor- 
tion away from ideal sphere packing. 

Closely packed face-centered cubic 
(fec) and hexagonal close-packed (hcp) 
structures are common from packing of 
hard spheres, but soft spheres often form 
non-close-packed lattices, including body- 
centered cubic (bcc), body-centered ortho- 
rhombic (bco), and the diamond lattices. 
Recently, the formation of the A15 phase for 
soft spheres has been suggested by theory 
(5, 6) and demonstrated experimentally for 
packing of “deformable” soft spheres as- 
sembled from lyotropic lipids, conical fan- 
shaped dendrimers (7), and asymmetric 
block copolymers (8) (see the figure, top 
and middle left, respectively). 
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Typically, these building blocks have a 
hard core that favors lattices with maxi- 
mum packing fraction and a soft corona 
that can be deformed to maximize entropy 
and minimize interfacial contact. Nonethe- 
less, these soft spheres do not constitute a 
single-particle system. Often a mixture of 
bec lattices, Frank-Kasper o phases, and 
Al5 phases is observed, because block co- 
polymer and dendrimer particles are not 
identical but are polydisperse—they have 
a distribution of chain lengths. The large 
number of chain conformations and the 
soft nature of polymer chains also impede 


It will be interesting 

to extend the chemical 
strategy presented by 
Huang et al. to construct 
other types of giant 
polyhedra and investigate 
their self-assembly. 


precise control of the molecular structures 
and their assemblies. 

Molecular nanoparticles (MNPs), includ- 
ing fullerenes such as C,,, POSS derivatives, 
and polyoxometalates, represent a new 
class of nanobuilding blocks. Bridging the 
length scale between polymers and small 
molecules, MNPs can be synthesized with 
explicit chemical composition and precisely 
controlled size, shape, symmetry, and func- 
tion. They are rigid, incompressible, and 
impenetrable. MNPs have been regarded as 
“nanoatoms” or “giant molecules.” 

Among them, POSS derivatives (9) have 
enjoyed rapid growth in research and appli- 
cations because of their well-defined struc- 
tures, high interaction parameters, and 
tunable chemistry of the periphery groups. 
Hydrolysis of RSiX, (where X is a halide and 
R can be alkyl, aryl, alkoxy, epoxy, hydroxy], 
carboxylic acid, amine, or vinyl groups), 
followed by condensation, leads to the 
spontaneous formation of cage-like POSS, 
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denoted T,R, (where each Si is connected 
with three oxygens to form a T unit). The 
resulting compounds have the general for- 
mula (RSiO, ja)" 

Creating giant tetrahedra with precise 
partitioning of hydrophobic and hydro- 
philic POSS cages requires site-selective 
functionalization and regioselective multi- 
functionalization of the apex by the POSS 
cages. First, the hydrophobic POSS cages 
with seven isobutyl groups (BPOSS) and 
one alkyne group are coupled to tetrakis(4- 
azidophenyl)methane via the azide-alkyne 
[3+2] cycloaddition reaction. Because these 
“click” reactions offer near-quantitative 
yield, giant tetrahedra with one, two, or 
three unreacted azide groups are obtained 
by controlling the stoichiometric feed ratio 
of the azide versus alkyne groups. In the 
second step, hydrophilic POSS cages with 
either hydroxyl or carboxylic acid groups 
are coupled to the giant tetrahedron via a 
thiol-ene reaction. Although vinyl groups 
are often incompatible with hydroxyl or 
carboxylic groups, they possess orthogonal 
reactivities in click chemistry, where the re- 
action is site-selective. 

Thus, it is possible to sequentially intro- 
duce hydrophilic POSS cages one by one 
on the giant tetrahedra, much like putting 
together Lego blocks. Such unprecedented 
control of shape and topology is not pos- 
sible in polymer systems. Incorporation of 
different reactivities leads to competing in- 
teractions that drive self-assembly—that is, 
collective hydrogen-bonding interactions 
between the hydrophilic POSS cages and 
the crystallization of BPOSS cages. Mean- 
while, replacing the BPOSS cage with a 
hydrophilic POSS cage on the tetrahedron 
lowers the molecular symmetry. 

When cast from solution on mica, crys- 
tallization of BPOSS cages dominates. To 
minimize interfacial energy, frustrated 
supramolecular lamellae are formed with 
two layers of BPOSS and one interdigitated 
layer of the hydrophilic POSS cages. The 
preformation of lamellae is critical to the 
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Different parts for self-assembly of A15 supramolecular lattices. On the right side of the figure, the Al5 lattice, which assembles for metal alloys from two different types of 
atoms, is shown as a three-dimensional rendering at the top and as a 4*-square tiling pattern at the bottom. Different routes to this same type of lattice are shown on the left side for 
dendrimers at the top, asymmetric block copolymers in the middle, and giant tetrahedral POSS cages, as elaborated by Huang et al., at the bottom. 


latter superlattice formation. Upon melting, 
hydrophilic POSS cages form spherical ag- 
gregates while BPOSS layers scroll to form 
the shell. The core-shell spheres spontane- 
ously pack into the A15 supramolecular lat- 
tice as a result of balancing the molecular 
symmetry and molecular interactions (see 
the figure, lower left). 

Design of giant nanoatoms from precisely 
controlled organic and hybrid materials also 
has caught the eye of polymer scientists. 
For example, Ober and co-workers dem- 
onstrated sub-50 nm patterns using glass- 
forming, low-molecular-weight organic 
compounds as photoresists (J0). Relative 
to traditional polymeric resists, molecular 
glass resists offer smaller feature size and 
line-edge roughness as well as higher dis- 
solution contrast, which is critical to driving 
miniaturization of electronic devices on the 
sub-100 nm scale. 

The precise synthesis of Lego-like POSS 
cages opens up an entirely new paradigm 
of self-assembly in which lattices can be as- 
sembled one-by-one and on demand. The 
rich chemistry that can be introduced on 
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POSS cages, the versatility and regioselec- 
tivity enabled by click chemistry, and the 
possible structural variations of POSS de- 
rivatives will be of great interest to mate- 
rials scientists. Besides T,-type POSS cages, 
there are less stable T,, T,,, and T,, POSS 
cages. It will be interesting to extend the 
chemical strategy presented by Huang et 
al. to construct other types of giant poly- 
hedra and investigate their self-assembly. 
Meanwhile, the coupling reactions and the 
molecular interactions can be fine-tuned to 
assemble other superlattices. Furthermore, 
small-molecule fluorophores, metal com- 
plexes, conjugated molecules, and peptides 
can be introduced into the nanocavity of 
the POSS cage or its periphery for imaging, 
sensing, catalysis, charge-carrier transport, 
and biological applications (9). Also, the T, 
core may not act as a traditional silica-like 
insulator. 

Finally, it will be interesting to disperse 
these shape-persistent tetrahedra and their 
superlattices into a polymer matrix or a liq- 
uid crystal medium, or to place them on a 
patterned surface to create truly hierarchi- 
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cal assemblies. They can also be tethered 
to a polymer chain to construct a block 
copolymer analog, so as to take advantage 
of their rich phase separation behaviors, or 
to form complexation with other polyhedra 
to create Janus superlattices. The ability to 
build nano-Legos with precise geometrical 
factors and tunable molecular interactions 
offers a powerful yet versatile tool to create 
ever more complex functional materials. & 
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Calcilytics for asthma relief 


A promiscuous calcium receptor holds promise 
as a therapeutic target for asthma 


By Raymond B. Penn 


or decades, patients suffering from 
allergic asthma—the symptoms of 
which range from wheezing and 
shortness of breath to extreme dif- 
ficulty in breathing—generally have 
been treated with agents that reduce 
airway inflammation, such as corticoste- 
roids, or reverse airway constriction, such 
as B,-adrenergic receptor agonists. Un- 
fortunately, up to 55% of asthmatics have 
suboptimal control with these drugs (J). 
Moreover, safety concerns regarding the 
use of B, agonists have resulted in more 
stringent guidelines for their clinical use 
(2). Thus, there is pressing need to de- 
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velop new and safe therapies. A study by 
Yarova et al. (3) points to a protein that 
controls the constriction and proliferation 
of smooth muscle cells lining the bronchial 
airway. Moreover, there is already a drug 
that inhibits this protein. 

The calcium-sensing receptor (CaSR) is 
best known for its role in regulating cal- 
cium homeostasis. Expressed in the para- 
thyroid gland, it surveys the concentration 
of circulating calcium. Calcium binds to 
and activates the CaSR, triggering intra- 
cellular signals that suppress the release 
of parathyroid hormone. This reduces 
calcium release from stores such as bone. 
However, the CaSR is promiscuous in its 
activation by other molecules, including 
polyvalent cations, amino acids, and virus 
elements. Indeed, Yarova et al. show that 
the CaSR uses this robust sensing abil- 
ity to control airway hyperresponsiveness 


Airway wall 
Mucus 


and inflammation in asthma. Importantly, 
inhibitory small molecules previously de- 
veloped to antagonize the CaSR and treat 
hypocalcemia mitigate key features of al- 
lergic asthma. 

Yarova et al. examined bronchial airway 
tissues and cells from a mouse model of 
asthma and biopsies from asthma patients, 
and demonstrate that calcium, as well as 
various agents (including eosinophilic cat- 
ionic protein and L-arginine-derived poly- 
amines) whose concentrations are elevated 
in the bronchial airway of asthma patients 
during allergic inflammation, can stimulate 
CaSR. In bronchial airway tissue excised 
from the mouse asthma model, as well as 
in the intact animal, this stimulation re- 
sulted in airway smooth muscle contrac- 
tion, and enhanced the contractile effect of 
bronchoconstrictors such as acetylcholine 
or histamine. The authors confirmed the 
CaSR-dependent nature of these effects 
in both human bronchial airway smooth 
muscle cells with reduced expression of 
CaSR and in bronchial airway tissue and 
cells isolated from genetically modified 
mice lacking CaSR. In addition, when mice 
with allergic lung inflammation inhaled 
CaSR antagonists (calcilytics), the asthma 
phenotype was reversed. Calcilytics blocked 
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Asthma relief. Numerous agents activate the CaSR expressed on airway smooth muscle, as well as on certain inflammatory cells and airway epithelia (not shown). CaSR activation in airway 
smooth muscle promotes contractile hyperresponsiveness and possibly airway remodeling. CaSR activity also facilitates inflammatory cell infiltration into the lung (not shown). Calcilytics 
block the CaSR, and may represent an inhalable agent to treat asthma. IP,, inositol 1,4,5-trisphosphate; ERK, extracellular signal-regulated kinase; PI3K, phosphatidylinositol 3-kinase. 
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the CaSR, prevented airway hyperrespon- 
siveness and, surprisingly, inflammation. 
The latter effect could be due to expression 
of the CaSR not only in resident lung cells 
(airway smooth muscle and epithelia) but 
also in invading inflammatory cells (eosino- 
phils, macrophages). 

Interestingly, Yarova et al. show that in- 
flammation increases CaSR expression in 
human and mouse. That is consistent with 
the findings that show a prominent role of 
the CaSR in affecting airway contractility 
under asthmatic conditions. 

Thus, the CaSR emerges as an exciting 
potential asthma therapeutic target (see 
the figure). Recently, both bitter tastant 
receptors and chloride channels have been 
proposed as drug targets in asthma (4). Ac- 
tivated bitter tastant receptors relax airway 
smooth muscle and dilate the airway in hu- 
man and mouse, but the mechanism is not 
yet clear (5). Chloride channel blockers also 
act on airway smooth muscle and limit the 
ability of agonists such as acetylcholine to 
stimulate contraction (6). However, calci- 
lytics have certain advantages that favor 
success as a therapy for asthma. They are 
small molecules that are readily deliver- 
able by inhalation, thus limiting potential 
problematic systemic effects. Their efficacy 
is favored by the ability to target multiple 
cell types and mechanisms that contribute 
to the asthma phenotype. Perhaps most 
importantly, the safety of three different 
oral calcilytics as osteoporosis drugs is sug- 
gested in phase I clinical studies (7, 8), and 
the calcilytic NPSP795 is currently in phase 
II clinical trials for the treatment of auto- 
somal dominant hypocalcemia (9). Results 
from these studies may help limit the regu- 
latory hurdles that an inhaled calcilytic 
as an asthma therapy will ultimately face. 
The ability to assess the presence of CaSR 
activators in the lung could enable identi- 
fication of those asthmatics most likely to 
respond to calcilytics, in line with the goal 
of personalized asthma treatment (J0). & 
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PROTEIN SYNTHESIS 


The delicate dance of 
translation and folding 


RNA translation and protein folding affect 
each other during protein synthesis 


By Joseph D. Puglisi 


he past decade has seen rapid growth 

in our knowledge of how proteins are 

synthesized in cells. This includes 

the basic step of how transfer RNAs 

(tRNAs) decode messenger RNAs 

(mRNAs) with high fidelity and speed, 
how the ribosome moves along mRNA from 
codon to codon during translation, and how 
synthesis of the corresponding polypeptide 
chain is initiated and terminated at spe- 
cific points on the mRNA (J). Structures 
of the ribosomal particles—megadalton 
RNA-protein assemblies—have provided 
detailed molecular views of the active sites 
for mRNA decoding and peptide bond for- 
mation, and suggested pathways for move- 
ment of ligands, factors, and the ribosomal 
subunits themselves. Obscured in this rich 
cache of knowledge is the fate of the protein 
product. How does it fold during mRNA 
translation and how might protein folding 
affect translation itself? These questions 
are addressed by elegant biophysical and 
biochemical approaches reported by Gold- 
man e¢ al. (2) and Kim et al. (3) on pages 457 
and 444, respectively, of this issue, adding 
to a growing appreciation of cotranslational 
protein folding (4-6). 

During translation, a nascent protein 
is synthesized from its amino to carboxyl 
terminus. The growing polypeptide passes 
through a 100 A conduit in the large ribo- 
somal subunit (the ribosomal exit tunnel) to 
emerge into solution. The exit tunnel is nar- 
row, and can only accommodate a-helical 
conformations of the polypeptide chain. 
Thus, the nascent protein can only fold 
into tertiary structure once the appropri- 
ate regions of the polypeptide chain are 
exposed to solution. The folding of small 
protein domains is spontaneous and rapid 
in vitro, with folding time scales of 10° to 
10° s. For larger proteins, those with com- 
plex topologies, or multidomain proteins, 
folding can take longer and require the 
assistance of protein-folding chaperones. 
Because translation occurs at rates of 1 
to 20 amino acids per second (depending 
on organism and conditions), folding and 
translation often occur simultaneously on a 
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translating ribosome (cotranslational fold- 
ing). Many chaperones and processing fac- 
tors bind to nascent chains as they emerge 
from the ribosomal exit tunnel (7). It is thus 
paramount to understand the relationship 
among protein synthesis rates and protein- 
folding processes. 

This interplay is underscored by the ex- 
istence of sequences within a nascent poly- 
peptide that cause translating ribosomes to 
pause or stall (8). Stalling sequences were 
discovered during translation of SecM from 
Escherichia coli. Ribosomes translating pro- 


"ow might protein folding 
affect translation itself?" 


teins with the SecM sequence, an 18-amino 
acid stretch, pause and stall when this se- 
quence is located within the ribosomal exit 
tunnel near the peptidyl tRNA and narrow 
constrictions in the tunnel (9). Interac- 
tions between the peptide and ribosomal 
elements slow and block subsequent steps 
in the elongation process (JO) at a precise 
point, during which a paused translation 
complex docks to a secretion tunnel that 
uses energy from the hydrolysis of ad- 
enosine 5’-triphosphate to pull the protein 
through the secretion tunnel. This suggests 
that force applied to the ribosomal nascent 
chain can relieve stalling of translation. Sev- 
eral in vivo measurements have supported 
this hypothesis (J/, 72). 

Applying elegant single-molecule ap- 
proaches, Goldman et al. demonstrate ex- 
plicitly that force on a polypeptide chain 
can relieve translational stalling. They used 
optical trapping technology, whereby mol- 
ecules attached to larger beads are trapped 
and manipulated by a strong laser beam, 
allowing measurements and changes in the 
force applied to a biological system. Gold- 
man e¢ al. prepared stalled, translating ri- 
bosomes containing nascent chains labeled 
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Folding generates force to 
pull out of paused position 


tunnel portal can generate a directional force that allows translation to resume. 


with biotin. A ribosome was linked, through 
a tether, to a static micropipette, and the 
biotinylated nascent chain was attached 
to a polystyrene bead. The stalled nascent 
chain was designed to have an amino-termi- 
nal calmodulin domain, which has a well- 
characterized folding behavior. At a fixed 
small applied force, the calmodulin domain 
fluctuates between folded and unfolded. 
This oscillation yields a clear signal as the 
polystyrene bead moves away and toward 
the center of the trap following the unfold- 
ing and refolding of the protein. Restarting 
of translation releases the tether as protein 
synthesis resumes, allowing an indirect 
monitoring of translation and force on the 
nascent chain. 

Goldman et al. show that increasing the 
force on the nascent chain overcomes the 
stalling induced by the SecM sequence. The 
authors then used a protein whose folding- 
unfolding transition as a function of force 
has been well characterized (Top7), to see 
whether folding of a protein near the ribo- 
some could generate a directional pulling 
force similar to that of an optical trap. Using 
an in vivo assay for synthesis of fluorescent 
protein that can only occur if a SecM stall 
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is relieved, Goldman et al. show that hav- 
ing a folding domain near the ribosomal 
exit tunnel could exert force on the stalled 
ribosomal nascent chain such that transla- 
tion can resume (see the figure). To relieve a 
SecM-induced stall in translation, at least 10 
pN of force—equivalent to that generated by 
folding of a small domain—must be applied 
to a nascent chain. 

The results of Goldman et al. show how 
protein folding can modulate translational 
dynamics, but can translation rates and oc- 
cupancy of nascent chains within the ribo- 
some also control protein-folding pathways? 
A growing body of experiments has probed 
this question both in vivo and in vitro. Kim 
et al. use ensemble fluorescence resonance 
energy transfer (FRET) to probe distances 
between an amino-terminal fluorescent pro- 
tein and dyes attached chemically to distinct 
residues in carboxyl-terminal domains of 
the cystic fibrosis transmembrane conduc- 
tance regulator, whose misfolding is linked 
to the disease. Kim et al. measured ensem- 
ble FRET for nascent chains that were ei- 
ther attached to ribosomes (not translating) 
or released free in solution, and found that 
late folding events that involved interaction 
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Elongation 
continues 


between the carboxy-terminal 
part of the polypeptide chain 
with an amino-terminal domain 
of the chain were slowed. This 
delay was tuned by the occu- 
pancy of that strand within the 
tunnel, and by stabilized folding 
of domains beyond the ribosome, 
through interaction with the ri- 
bosome. These results, and those 
of Goldman et al., show how the 
ribosome itself can guide folding 
pathways. Kim et al. also show 
how codon usage could tune 
translation rates and occupancy 
of nascent chains within the exit 
tunnel, consistent with prior pre- 
dictions from multiple studies. 

The results of Goldman et 
al. and Kim et al. demonstrate 
clearly the coupling of protein 
folding and translation, but 
much work remains. Real-time 
dynamic data (13) would reveal 
directly how folding and transla- 
tional steps are linked. Structural 
studies are needed, with ad- 
vances in cryo-electron micros- 
copy in particular (14), to show 
how a ribosome interacts with 
a nascent chain to stall transla- 
tion or guide folding pathways. 
An additional layer of biological 
importance and complexity is 
the cluster of factors that bind to 
a nascent polypeptide as it exits 
a ribosome. These factors process or modify 
the chain, chaperone its folding, or guide 
the nascent protein to a cellular target (7). 
Defining their roles dynamically, mechanis- 
tically, and structurally will enrich our view 
of translation. & 
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SUSTAINABILITY 


Ecosystem services lost to oil 
and gas in North America 


Net primary production reduced in crop and rangelands 


By Brady W. Allred,!* W. Kolby Smith,’ 
Dirac Twidwell,? Julia H. Haggerty, * 
Steven W. Running,’ David E. Naugle,’ 
Samuel D. Fuhlendorf® 


dvanced technologies in oil and 
gas extraction coupled with energy 
demand have encouraged an aver- 
age of 50,000 new wells per year 
throughout central North America 
since 2000. Although similar to past 
trends (see the graph, this page), the space 
and infrastructure required for horizontal 
drilling and high-volume hydraulic fractur- 
ing are transforming millions of hectares of 
the Great Plains into industri- 
alized landscapes, with drilling 
projected to continue (J, 2). Al- 
though this development brings economic 
benefits (3) and expectations of energy se- 
curity, policy and regulation give little at- 
tention to trade-offs in the form of lost or 
degraded ecosystem services (4). It is the 
scale of this transformation that is impor- 
tant, as accumulating land degradation 
can result in continental impacts that are 
undetectable when focusing on any single 
region (5). With the impact of this transfor- 
mation on natural systems and ecosystem 
services yet to be quantified at broad ex- 
tents, decisions are being made with few 
data at hand (see the graph, this page). 

We provide a first empirical analysis 
to advance beyond common rhetoric and 
speculation of oil and gas development (6), 
combining high-resolution satellite data 
of vegetation dynamics with industry data 
and publicly available data of historical 
and present-day oil and gas well locations 
for central North America. In addition to 
this broad-scale assessment of satellite- 
derived net primary production (NPP), a 
fundamental measure of a region’s ability 
to provide ecosystem services (7), we also 
evaluate patterns of land-use change and 
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water use. Before this work, little has been 
done in examining these types of data and 
their relations with ecosystem services at 
broad scales. 


ECOSYSTEM SERVICE TRADE-OFFS. NPP 
is the amount of carbon fixed by plants and 
accumulated as biomass. It is a fundamen- 
tal and supporting ecosystem service that is 
the basis for all life on Earth (8). As such, 
the dynamics of NPP affect regional ability 
to provide a host of other essential ecosys- 
tem services (e.g., food production, biodi- 
versity, wildlife habitat), which makes it a 
robust metric for broad evaluation of eco- 
system services. Oil and gas activity reduces 
NPP through direct removal of vegetation 
to construct oil pads, roads, and so on. 
These satellite-derived measurements of 


of carbon per year, we convert to equiva- 
lent biomass-based measurements to pro- 
vide context and discussion. 

We estimate that vegetation removal 
by oil and gas development from 2000 to 
2012 reduced NPP by ~4.5 Tg of carbon or 
10 Tg of dry biomass across central North 
America (see the chart on page 402, left). 
The total amount lost in rangelands is the 
equivalent of approximately five million 
animal unit months (AUM; the amount 
of forage required for one animal for 1 
month), which is more than half of annual 
available grazing on public lands managed 
by the U.S. Bureau of Land Management 
(BLM). The amount of biomass lost in 
croplands is the equivalent of 120.2 million 
bushels of wheat, ~6% of the wheat pro- 
duced in 2013 within the region and 13% 
of the wheat exported by the United States 
(see SM for equivalency calculations). 

The loss of NPP is likely long-lasting and 
potentially permanent, as recovery or rec- 
lamation of previously drilled land has not 
kept pace with accelerated drilling (SM). 
This is not surprising because current rec- 
lamation practices vary by land ownership 
and governing body, target only limited 
portions of the energy landscape, require 
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The number of oil and gas wells drilled within central provinces of Canada and central U.S. states 1900-2012. 
Canadian provinces: Alberta, Manitoba, and Saskatchewan. U.S. states: Colorado, Kansas, Montana, Nebraska, 
New Mexico, North Dakota, Oklahoma, South Dakota, Texas, Utah, and Wyoming. See SM. 


NPP began in 2000 and are produced an- 
nually; they capture interannual dynamics 
(9). To match the spatial scale of NPP mea- 
surement (~1 km’), we determined annual 
density of oil and gas activity at the same 
resolution and estimated annual loss of 
NPP relative to such densities. Direct loss 
of vegetation resulting from oil and gas 
activity was validated at medium and fine 
spatial scales (~250 m? and 30 m”, respec- 
tively) by examining vegetation and dis- 
turbance trends before and after drilling 
[see supplementary materials (SM)]. We 
categorized annual reductions in NPP rela- 
tive to land cover type (e.g., cropland and 
rangeland). As NPP is measured in grams 
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substantial funding and implementation 
commitments, and are often not initiated 
until the end of life of a well (JO). Barring 
changes from existing trends and practices, 
it is likely that NPP loss and its effects (i.e., 
further loss of forage) will continue to par- 
allel drilling trends and, potentially, may 
create unforeseen conflicts among agricul- 
ture, conservation, and energy. 

Additional ecosystem functions, includ- 
ing wildlife habitat and landscape connec- 
tivity, are arguably as important as NPP. 
We estimate that the land area occupied by 
well pads, roads, and storage facilities built 
from 2000 to 2012 is ~3 million ha, the 
equivalent land area of three Yellowstone 
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Oil and gas development impacts on ecosystem services 
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Cumulative impacts of oil and gas development on ecosystem services in central North America 2000-2012. (Left) Reduction in NPP (biomass), per land cover type. 


(Middle) Land area occupied, per land cover type. (Right) Number of wells in water-stressed regions (22). See SM. 


National Parks (see the chart, middle). Al- 
though small in comparison with the total 
land area of the continent, this important 
land use is not accounted for (17) and cre- 
ates additional pressures for conserving 
rangelands and their ecosystem functions. 
The distribution of this land area has nega- 
tive impacts: increasing fragmentation that 
can sever migratory pathways, alter wild- 
life behavior and mortality, and increase 
susceptibility to ecologically disruptive 
invasive species (6, 12). As competition for 
arable land intensifies because of food and 
bioenergy demand (/3), oil and gas may 
further expand into native rangelands. 

The hydraulic fracturing technology un- 
derlying the current expansion of oil and 
gas drilling in the region has profound im- 
plications for hydrological, water-quality, 
and water-use regimes. High-volume hy- 
draulic fracturing uses 8000 to 50,000 m? 
of water per well for the initial fracturing 
event (14), which results in 7187 to 33,903 
million m? for wells drilled across this re- 
gion during 2000 to 2012 (see SM). Nearly 
half of wells drilled in this time period 
occurred in already highly or extremely 
water-stressed regions (see the chart, 
right). As refracturing becomes more 
common to yield greater production, oil 
and gas development adds to an already 
fraught competition among agriculture, 
aquatic ecosystems, and municipalities for 
water resources, in addition to concerns of 
water quality (15). 


AVOIDING BROAD-SCALE LOSS. The capac- 
ity for insight into land-use decisions has 
improved substantially since the last major 
episode of widespread land-use change across 
the Great Plains. In the early 20th century, 
rapid agricultural expansion and widespread 
displacement of native vegetation reduced 
the resilience of the region to drought, ul- 
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timately contributing to the Dust Bowl of 
the 1930s. It took catastrophic disruption of 
livelihoods and economies to trigger policy 
reforms that addressed environmental and 
social risks of land-use change. 

Fortunately, data and information are 
now far less of a barrier in understanding 
and addressing continental and cumula- 
tive impacts. However, the scale and focus 
of most land-use decision-making discour- 
ages comprehensive assessment of trade- 
offs implied in oil and gas development (J6, 
17). Recent planning efforts by U.S. federal 
management agencies demonstrate poten- 
tial to balance demand for energy develop- 
ment with the need to protect other values 
(18, 19), but the scope is limited to lands 
under federal jurisdiction. About 90% of oil 
and gas infrastructure in this region occurs 
on private land (United States only; see 
SM). Provinces, states, and municipalities 
that permit the majority of oil and gas de- 
velopment lack the capacity and mandate 
to address continental or regional conse- 
quences that transcend political bound- 
aries; this lack leads to fragmented and 
piecemeal policies (16, 20). 

Decision-makers and scientists must 
work together to ensure that the best avail- 
able information guides development of 
policies at the water-energy-food nexus 
(21). Traditional laws and regulations may 
have limited application, as oil and gas 
can be exempt from key environmental 
regulations (20), or such regulations iso- 
late features of systems—e.g., a single spe- 
cies—while failing to capture interrelated 
impacts. Active synthesis and consolida- 
tion of data will improve accessibility and 
monitoring. Integration of these data into 
land-use planning and policy across scales 
and jurisdictions is necessary to achieve 
energy policies that minimize ecosystem 
service losses. 
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Red knots and other 
shorebirds - the 
final leg of their annual ad 
spring migration. 


COASTAL ECOLOGY 


Migration, interrupted 


The imperiled existence of the rufa red knot 


By Nigel A. Clark 


n order for the shorebirds known as red 

knots to survive their annual migration 

from the Southern Hemisphere to the 

species’ Arctic breeding grounds, they 

rely on a series of sites along the way 

to provide enough food for their stay 
and to fuel their onward flight. If just one 
of these sites is compromised, the species 
itself will fail. 

Unfortunately for the knots, many of 
these vital sites are vast, intertidal flats that 
humans have come to see as areas ripe for 
development and exploitation. This is es- 
pecially true for the main spring stopover 
sites that knot populations on each conti- 
nent rely on, including areas in the Dela- 
ware Bay in the Americas, the Wadden Sea 
in Europe, and the Yellow Sea in East Asia. 
The marvel of their long migration and 
the beauty of their synchronously weaving 
flocks have inspired many to try to ensure 
that there is a place on the planet for the 
knots for years to come. 

In The Narrow Edge, Deborah Cramer 
follows the migration route of the Ameri- 
can rufa knot from the southern tip of 
South America to the high Arctic, where 
the ground thaws just long enough for 
them to raise their chicks before winter sets 
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in again. The book is perhaps more about 
people than birds, as Cramer tries to under- 
stand how and why humans and birds have 
come into conflict. She explores what moti- 
vates developers, fishermen, and conserva- 
tionists and explores the ways scientists are 
working to understand how we can live to- 
gether on an increasingly crowded planet. 
Throughout the book, Cramer explores the 
history that has led to the tenuous existence 
of the knot, as well as the tangled ecological 
web that it, and we, are part of. 

The book is written from a conservation- 
ist’s viewpoint, but Cramer also tries to 
understand the driving forces that, often 
inadvertently, make it more difficult for the 
knots to survive. For example, she recounts 
how, until recently, the salt marshlands in 
Argentina’s Rio Gallegos estuary were being 
sold off to developers and used as a munici- 
pal waste dump. As the population of the 
nearby city expanded over the past half- 
century, the number of knots observed each 
year in the estuary dropped precipitously. 
Now, as a result of the efforts of scientists 
Silvia Ferrari and Carlos Albrieu, Rio Gal- 
legos has two protected areas where further 
development of the marsh is prohibited. 
The city has also made an asset of the 
marsh, creating a visitor center and bird- 
viewing areas to educate locals and tourists 
about the value of conservation efforts. This 
has come too late for the knots, which no 
longer stop there, but has proved vital for 


Published by AAAS 


The Narrow Edge | 
ATiny Bird, an Ancient The Mary tg 
Crab, and an Epic Journey 
Deborah Cramer 

Yale University Press, 2015. 
303 pp. 


many other species of shorebirds. Perhaps, 
in time, the knots will return. 

Many knots that migrate from the Ameri- 
cas make their last stop to refuel in the 
Delaware Bay before heading to the Arctic. 
Here, they gorge on horseshoe crab eggs 
that are washed out of the sand, where 
they are laid in May of each year. The crabs 
and, hence, the birds have declined over the 
past two decades. It would be easy to blame 
modern fishermen for overexploiting the 
crabs, but Cramer shows that the tendency 
toward aggressive harvesting is consistent 
with historical practices. During the mid- 
to late 1800s, horseshoe crabs were fed to 
pigs or ground into fertilizer in specially 
built factories. More than a million crabs 
were harvested from a single mile of beach 
in 1857, and over 4 million were taken from 
all of Delaware Bay in 1880. By the late 
1800s, the horseshoe crab population in 
the Delaware Bay was almost nonexistent. 
It slowly recovered over the next century 
but crashed again when fishermen began 
collecting the crabs to use as bait for conch 
and eel fishing in the 1990s. With a history 
of exploitation, it is not surprising that the 
fishermen saw the crabs as a resource to be 
taken advantage of, rather than a popula- 
tion to be managed sustainably. Since 1998, 
there have been progressively more severe 
harvest restrictions on the crabs but, with 
a 10-year life cycle, we are just beginning to 
see signs of recovery. 

If the horseshoe crab had gone extinct, 
the knots would not have been the only 
ones to suffer. In the early 1950s, the phy- 
sician Frederick Bang discovered that the 
crab’s blue blood clots in the presence of 
Gram-negative bacteria. By the mid-1980s, 
limulus amebocyte lysate (LAL), made from 
the blood of horseshoe crabs, had become 
an invaluable aid in the fight against infec- 
tion in humans. 

By focusing on the plight of one flagship 
species and the people who have dedicated 
their lives to understanding how to pro- 
tect it, this book shows how conservation 
efforts are critical to maintaining coastal 
biodiversity. It also offers important les- 
sons and strategies that may be imple- 
mented for the protection and preservation 
of other species. 
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Hard evidence 


Exploring the origins and evolution of forensic science 


By Andrew Robinson 


he rise of forensic science is said to 

date from the 19th century: the era 

of a reliable chemical test for arsenic 

poisoning, Michael Faraday’s expla- 

nation of combustion, the introduc- 

tion of fingerprinting, and Arthur 
Conan Doyle’s fictional detective, Sherlock 
Holmes. But its beginnings actually lie far 
earlier, as chronicled in the wide-ranging 
Wellcome exhibition Forensics: The Anat- 
omy of Crime. 

The exhibit includes a Chinese handbook 
for coroners, published in 1247, that is still 
regarded as one of the crucial volumes on 
postmortem methods. The Washing Away 
of Wrongs by Song Ci includes the absorb- 
ing case of a man stabbed to death by the 
roadside. After comparing the slash marks 
on the body with his own experimental 
slash marks on a cow’s carcass using vari- 
ous blades, the coroner concluded that the 
murder weapon was a sickle. He lined up all 
of the neighborhood’s 70 adults with their 
sickles—free of any obvious blood traces—at 
their feet. Within seconds, a fly landed on 
the local moneylender’s sickle, then another, 
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and another. According to the handbook, the 
moneylender now “knocked his head on the 
ground” and confessed to the murder. In the 
words of crime novelist Val McDermid, who 
wrote the gripping nonfiction book that ac- 
companies the Wellcome exhibition (1), he 
had been betrayed by “the insect informers 
humming quietly at his feet.” McDermid’s 
book is based on revealing interviews with 
practicing experts in all fields of forensic sci- 
ence, some of whom also feature in the exhi- 
bition’s video interviews. 

Violent crime and the motives of killers 
have always fascinated the public, even 
those who avoid crime fiction. Indeed, the 
English word morgue derives from the 
French word morguer, meaning roughly 
“to peer.” The Paris Morgue, an institution 
established in the early 1800s, was open to 
the public, who could peer through win- 
dows at unidentified corpses. The hope was 
that this would promote the identification 
of the unknown, but it quickly became a 
popular destination, attracting foreign visi- 
tors and even mothers with young children. 
A century later, the morgue was closed to 
idle gazers “out of concern for public moral- 
ity,” notes a caption that accompanies one 
of the illustrations on display. 

The exhibition necessarily eschews hu- 
man remains, except for some skulls and 
bones, including some that were exhumed 
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from mass graves during recent massacres. 
Yet there is plenty to instruct, discomfort, 
appall, and entertain the visitor. For ex- 
ample, in a deliberately dark corner of “The 
Morgue,” next to an illuminated ceramic 
postmortem table, one can don a pair of 
headphones and listen to the soundtrack of 
areal human autopsy recently conducted in 
murder-ridden Mexico. This part of the ex- 
hibition is not for the squeamish. 

One of the most thought-provoking ex- 
hibits is far from the most gruesome, al- 
though it does have something of the appeal 
of a Hitchcock horror film. At the start of 
the exhibition, one may peer into the rooms 
of what looks like a large doll’s house, in 
which a murder has been committed. The 
house is one of a series of models known 
as “Nutshell Studies of Unexplained Death,” 
created in the 1930s and 1940s by Frances 
Glessner Lee, the wealthy Chicago heiress 
who founded the Harvard School of Legal 
Medicine in 1931. Both police officers and 
crime writers (including Erle Stanley Gard- 
ner, the creator of the Perry Mason series 
of detective stories) have long used Nutshell 
models as a training tool. In 2012, they be- 
came the subject of a documentary film, Of 
Dolls and Murder (2), a clip from which is 
shown next to the model. Disquieting music 
from the film can be heard throughout the 
early portion of the exhibition. 

At the end, three near-life-sized con- 
temporary photographs by the artist Taryn 
Simon serve as a reminder of one of the ben- 
efits of forensic science. Shot as part of the 
Innocence Project, an initiative dedicated to 
exonerating wrongfully convicted individu- 
als and reforming the criminal justice sys- 
tem in the United States, each image shows 
a falsely imprisoned man posing after his 
release at the scene of his alleged crime. In 
the United States alone, more than 300 con- 
victed people have been exonerated by DNA 
evidence and released from incarceration 
since the organization’s inception in 1992. 
Without sensitive and sophisticated forensic 
investigations, there would likely be more 
such miscarriages of justice. 
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A path to recruiting 
more STEM teachers 


I STRONGLY DISAGREE with the implication 
in the News Feature “A classroom experi- 
ment” (J. Mervis, 6 February, p. 602) that 
recruiting new K-12 teachers in science, 
technology, engineering, and mathemat- 

ics (STEM) is futile and that efforts to 
improve education should instead focus on 
in-service teachers. For disciplines such as 
physics and chemistry, fewer than half of 
all classrooms are led by a teacher with a 
degree in the subject (7). Without efforts to 
improve recruitment, we cannot effectively 
address the severe shortage of physical sci- 
ence teachers with deep content knowledge. 
Teachers with deep content knowledge in 
their subject, along with mastery of content- 
specific pedagogical skills, are the most 
important factor in ensuring excellence in 
STEM education (2). 

The American Physical Society, American 
Association of Physics Teachers, and dozens 
of universities across the country have 
worked during the past 15 years to address 
the severe shortage of physics teachers 
through the Physics Teacher Education 
Coalition (PhysTEC) project. PhysTEC sites 
establish a positive climate for teaching 
in the physics department; offer experi- 
ences that expose students to the rewards 
and challenges of teaching; build effective 
pathways for students to earn a degree and 
certification; and host a master teacher who 
mentors students. The result: PhysTEC sites 
have more than doubled the number of 
graduates well prepared to teach physics. 

Moreover, in our experience, Noyce 
scholarships have provided critical financial 
support to PhysTEC students. Although 
scholarships alone are typically not enough 
to recruit new teachers, they are a key piece 
of the puzzle along with transforming disci- 
plinary departments. 

I was also troubled by the inaccurate 
statement that “5-year attrition rates rise 
to 50% or higher.” Five-year retention rates 
for new teachers are about 70% (3), and 
retention rates for PhysTEC teachers are 
even higher (4). Propagating unfounded 
statements that confirm negative stereo- 
types about K-12 teaching is damaging and 
undermines efforts to improve it. 

Monica Plisch 
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Editor’s note 


LAST OCTOBER, SCIENCE published an 
investigation into a dispute between 
physicist Ulf Leonhardt and the Centre 
for Optical and Electromagnetic Research 
(COER) at South China Normal University 
(SCNU) in Guangzhou (“Show me the 
money,” 24 October 2014, p. 411). The 
story centered on Leonhardt’s recruitment 
by COER as a part-time distinguished 
professor, funded by generous grants 
from the Chinese Central and Guangdong 
governments, and the premature end of 
his contract, when he charged that COER 
was not transparent about the sources of 
money and was spending it on research 
he did not know about or authorize. 
COER, for its part, accuses Leonhardt 

of bad faith and breach of contract. The 
dispute is acrimonious and ongoing. 

Our goal in writing about the dispute 
was to illuminate the potential for misun- 
derstanding and conflict inherent in these 
arrangements, which have become more 
common as China’s science develops. The 
allegations reported in the article reflect 
the positions of the respective parties, and 
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Science does not endorse them. We regret 
any implication to the contrary. 

Since the article appeared, we have 
learned that it contains several minor 
inaccuracies concerning the timing of 
events and the details of the grants and 
contracts. We have corrected these in the 
online version of the story. We have also 
clarified other language and removed 
an illustration that was open to misin- 
terpretation. Since publishing the story, 
we have learned that before Leonhardt 
arrived in China, he received a draft of a 
presentation describing research planned 
for the Guangdong Leading Talent proj- 
ect. Leonhardt also took part in an oral 
defense in Beijing on March 21st, 2012, 
for the Guangdong Leading Talent project 
application. He and COER continue 
to disagree about how much advance 
knowledge he could have had about the 
research funded by his grants. 

Tim Appenzeller 
News Editor 


A Chinese physics 
institute's defense 


“SHOW ME THE money’(M. Hvistendahl, 
News Feature, 24 October 2014, p.411) 
contains numerous false allegations, 
misinterpretations, and bias, which 

we must address. 

Leonhardt claims he was unaware of 
the grant application contents. However, 
Leonhardt had a chance to revise a draft 
of the slides for the Guangdong Leading 
Talent Project and presented the slides to 
the committee. They detail the projects on 
transformation optics-based super- 
resolution imaging, cloaking, and absorb- 
ers for which funding was sought. Thus, 
Leonhardt should have been aware of 
the contents of the grant application. 
Casimir forces are briefly mentioned in 
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the Thousand Talents application because 
Leonhardt provided his Casimir force work 
(1) in a package of his selected papers 

when he asked the Centre for Optical and 
Electromagnetic Research (COER) to pre- 
pare the application for him. At that time 
no COER member worked on Casimir forces. 
Leonhardt claims he did not suggest Casimir 
forces until one year after August 2011, 
whereas the article claims that “Leonhardt 
planned to work on the theory of Casimir 
forces,’ which is self-contradictory. 

The document provided to Science by 
Professor Jun Li explicitly connects the team 
slide (including potential collaborators) pic- 
tured in the article to the research directions 
of the Leading Talent defense, which did not 
include the Casimir force. 

Our finances have been strictly in accor- 
dance with the approved budget and grant 
regulations. 790,043 RMB ($128,462) was 
spent on research equipment; 208,920 RMB 
($33,971) was spent on travel for Leonhardt, 
his partner Jana Silberg, and their invited 
guests; and the unused 4,001,037 RMB 
($650,575) was returned to the grant author- 
ity. Normal salary is taxed and transferable 
abroad, whereas tax-free subsidies are 
intended to be used in China. Leonhardt 
made several unsuccessful attempts to 


transfer subsidy money abroad while in 
Guangzhou from 12 to 18 January 2013. On 
28 January 2013, Leonhardt e-mailed to 
COER, “Concerning Jana’s payment, please 
arrange for a regular salary and give me 
the account number where I can return the 
0.5M.” This return would not have affected 
the net income Leonhardt would have 
received if he completed his 5-year con- 
tract. This 500,000 RMB ($81,301) remains 
frozen in a South China Normal University 
(SCNU) bank account awaiting government 
instruction. 1,566,667 RMB ($254,743) in 
pre-tax salary and housing subsidy remains 
in Leonhardt’s possession, and he shows 
no intention of returning any of it. Since 
Leonhardt was in mainland China for only 
57 days while under contract and did not 
fulfill the contract’s terms, SCNU is taking 
legal action to recover a substantial portion 
of this money. 

While we appreciate the over 20 
changes, the modified article online 
still misrepresents our center’s voice, 
most notably Professor Sailing He’s quotes. 
They are out of context and based on the 
journalist’s notes, which He did not have a 
chance to review and sign. When He said 
that Leonhardt “doesn’t need to care about 
the details,” he was referring to whether 


the income was categorized as subsidies 
or salary. 

As noted in the article, Leonhardt has 
waged a campaign through many Chinese 
agencies to defame COER. No Chinese 
agency has responded, because COER did 
nothing wrong and Leonhardt’s allegations 
are groundless. 

Chinese culture values trust between peo- 
ple. Unfortunately, Leonhardt betrayed our 
trust. We have learned some hard lessons 
and proposed reforms to related government 
agencies. SCNU has always strictly imple- 
mented relevant provisions of the national 
Thousand Talents Program and Guangdong 
Leading Talent Project, accepted supervi- 
sion from upper-level departments, and 
thanks the community for support. SCNU 
will continue to improve management and 
service, and always welcome highly talented 
individuals who actively promote teaching 
and research development at the University. 

Langping He 

Deputy Dean, Centre for Optical and 
Electromagnetic Research and Academy of 
Advanced Optoelectronics, South China Normal 


University, Guangzhou, Guangdong, 510006, China. 
E-mail: lpbhe@scnu.edu.cn 
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Documenting 
border conflicts 
from afar 


Civilian researchers find early 
signs of cross-border violence 
in satellite images 


By Ginger Pinholster 


y early April 2014, despite denials by 

Russian government officials, signs of 

a coordinated Russian military cam- 

paign to annex Ukraine’s Crimean 

Peninsula were falling into place. On 

10 April, NATO released commercial 
satellite images showing a buildup of Rus- 
sian forces at Novocherkassk and other 
towns along the border with Ukraine. A Rus- 
sian military official claimed that the photos 
showed military drills from August 2013, but 
AAAS analysis of satellite images from the 
same time period and locations reinforced 
NATO’s case. 

Several weeks earlier, that AAAS research 
effort had also confirmed reports of military 
activity in the Ukrainian port city of Sevas- 
topol. The satellite imagery showed, for ex- 
ample, military vehicles parked in depots 
throughout the area and a tugboat towing 
a floating chain to control access to the en- 
trance of Sevastopol Bay. 

Civilian researchers have long used satel- 
lite images to investigate threats to human 
rights and the environment, but the AAAS 
study, which was part of a broader effort to 
document cross-border conflicts, represents 
a new direction for this type of analysis, said 
Susan Wolfinbarger, director of the AAAS 
Geospatial Technologies Project. In addition 
to revealing destruction to cultural sites or 
natural resources, satellite images can help 
“pierce the fog of war” by confirming or over- 
turning reports about what is happening on 
the ground during conflicts, said Jonathan 
Drake, a AAAS senior program associate. 

This research is “hitting at a huge and very 
important question,’ said Noel Dickover, se- 
nior program officer at the PeaceTech Lab, 
which is affiliated with the United States In- 
stitute of Peace (USIP): “What if peace-build- 
ers had access to geospatial information in 
near-real time to look at incidents of violent 
conflicts before they arise? How would that 
change our world?” 

Wolfinbarger and her colleagues spoke at 
an 11 March event at USIP, which funded 
seven cross-border conflict studies by 
AAAS. The researchers compared images 
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from before and during the escalation of 
the conflicts, which took place in regions 
roiled by the end of colonialism in Africa 
and South/Southeast Asia, and by the fall of 
the Soviet Union. Together, these images re- 
vealed key warning signs of impending war: 
the movement of materials and troops, as 
well as the appearance of new infrastructure 
and military camps. 

Evidence of troop movements could be 
seen in satellite images from the majority 
of the conflicts studied, said AAAS Program 
Associate Eric Ashcroft. In Africa, for ex- 
ample, a new road with access to the dis- 
puted border between Djibouti and Eritrea 
seemed to signal the buildup of military 
activity before a 2008 conflict. Military mo- 
bilization of troops was more directly vis- 
ible before the 2001-02 conflict between 
India and Pakistan. The sudden emergence 
of new infrastructure also marked the start 
of a conflict at the Cambodia-Thailand 
border in 2008. And, shortly after a 2011 
referendum made South Sudan an indepen- 
dent state, “you could see a large amount 
of materiel moving through the space, pre- 
sumably to the front lines” at the Sudanese 
border, Ashcroft said. 

For now, the high-resolution satellite im- 
ages needed for this type of analysis aren’t 
easily available. The first commercial satel- 
lite was launched in 1999, and now there 
are eight offering image resolutions of one 
meter or less. Costs can be prohibitive, how- 
ever: a single, small image can cost $250 or 
more, and to analyze each location requires 
(at minimum) an image captured before and 
after a conflict or event of interest. Commer- 
cial satellites also do not acquire images of 
the whole globe continuously. 

In the future, the emergence of smaller 
“microsatellites” that continuously scan 
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A March 2014 influx of helicopters at Russia’s 
=ys _Kacha Airbase in Crimea signaled trouble ahead. 


the planet may potentially support more 
automated analysis of regions over time, 
Ashcroft noted. Those images are consis- 
tently captured from a 90-degree angle, 
he explained, and so they can be precisely 
aligned on top of each other. 

As more images become available for 
civilian analysis, said Dickover, “you can 
envision scenarios in the very near future 
where ad hoc groups would have the abil- 
ity to crowd-fund the cost of the imagery, 
and you already see in the humanitarian 
space where they are doing the analysis 
themselves. Imagine if that happens to the 
peace-building world...the implications are 
fairly vast.” 

The AAAS Geospatial Technologies Proj- 
ect has posted its seven case studies and 
other instructional documents for research- 
ers who want to leverage satellite-image 
analysis, at http://aaas.org/geotech/borders. 
The group has also published an ethics 
statement about satellite imaging in cul- 
tural sites of conflict. m 


AAAS launches new Public 

Engagement Institute 

The newly announced Alan I. Leshner Leadership 
Institute for Public Engagement will empower 
cohorts of scientists and engineers to commu- 
nicate effectively about key issues at the inter- 
section of science and society. As of 10 April 
2015, more than 130 gifts and pledges had 
been committed, exceeding the initial goal 
of $500,000 to fund the Institute in its first 5 
years. The first class of fellows, planned to con- 
vene in 2016, will focus on climate change, 
with subsequent topics to be determined by a 
AAAS advisory committee. Give online at www. 
supportaaas.org/AlanLeshnerFund. 
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SUPERNOVAE 


Zap the dust away? 
Not so fast 


t would be great to clean up 
dust by vaporizing it, but even 
powerful blasting by an explod- 
ing star doesn't seem capable of 
this. Supernovae produce vast 
amounts of dust, but it is a mystery 
how that dust survives ensuing 
hostile conditions. Lau et a/. used 
the SOFIA telescope to observe the 
dust associated with the supernova 
remnant Sgr A East. They found 
that this dust had endured far lon- 
ger than expected, which indicates 
that dust in the universe's oldest 
galaxies can also be attributed to 


The supernova re 
Sagittarius A Ea 
surrounded by di 


What controls 
clumped isotopes? 


Stable isotopes of a molecule 
can clump together in several 
combinations, depending on their 
mass. Even for simple molecules 
such as O,, which can contain 60, 
YO, and #80 in various combina- 
tions, clumped isotopes can 
potentially reveal the tempera- 
tures at which molecules form. 
Away from equilibrium, however, 
the pattern of clumped isotopes 
may reflect a complex array of 
processes. Using high-resolution 
gas-phase mass spectrometry, 
Yeung et al. found that biological 
factors influence the clumped 
isotope signature of oxygen 
produced during photosynthesis 
(see the Perspective by Passey). 
Similarly, Wang et al. showed that 
away from equilibrium, kinetic 
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effects causing isotope clumping 
can lead to overestimation of the 
temperature at which microbially 
produced methane forms. — NW 


Science, this issue p. 431; p. 428; 
see also p.394 


Ebola virus 
vaccine candidate 


As there are so few possibilities 
for drugs and vaccines that 
protect against the Ebola virus 
(EBOV), we need more options. 
Marzi et al. present initial studies 
in monkeys of a promising whole 
Ebola virus vaccine based on 
a defective form of the virus in 
which an essential viral gene 
(VP30) is knocked out. One or 
two doses of this defective virus, 
with or without further peroxide 
inactivation, protected against 
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supernovae. — MMM 


a lethal challenge of EBOV. In 
limited immunological analyses, 
protection correlated with the 
production of antibodies to the 
EBOV envelope. — CA 

Science, this issue p. 439 


Generating homozygous 
mutations 


Loss-of-function mutations may 
only produce a mutant pheno- 
type when both copies of the 
gene are mutated. Gantz and 
Bier developed a method they 
call mutagenic chain reaction 
(MCR) that autocatalytically pro- 
duces homozygous mutations. 
CR uses the initial mutated 
allele to cause a mutation in 
the allele on the opposing 
chromosome and thus the 
homozygosity of the trait. 


CR 
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technology could have broad 
applications in diverse organ- 
isms. — BAP 


Science, this issue p. 442 


A genetic cause for 


severe influenza 


Although chicken soup and 
plenty of rest get most kids 
through an influenza virus 
infection, some require hos- 
pitalization. Ciancanelli et al. 
report on one child who suffered 
severely from influenza because 
of null mutations in the gene 

for transcription factor IRF7. 
Cells isolated from this patient 
could not make enough secreted 
antiviral proteins, called inter- 
ferons, to halt viral replication. 
The requirement for IRF7 seems 
quite specific, because this 
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patient recovers normally from 
other common childhood viral 
infections. — KLM 

Science, this issue p. 448 


BIOENGINEERING 
Drug testing in a patient’s 
own tumor cells 


It is not easy to predict a 
patient’s response to chemo- 
therapy with animal models and 
cultured cells. The true test of 
responsiveness requires the 
evaluation of drug activity within 
human tumors. In two studies, 
Jonas et al. and Klinghoffer et al. 
have engineered devices that 
deliver microdoses of drugs 
directly into tumors. After 
treatment, the researchers 
removed the tumor tissue and 
quantified cancer-cell death. In 
both studies, the local response 
to chemotherapy matched the 
systemic response to known, 

as well as experimental, drugs. 
However, drug sensitivities were 
revealed that were not detected 
in cell culture. — MLF 


Sci. Transl. Med. 7,284ra57 
and 284ra58 (2015). 


PROTEIN FOLDING 
Ribosomes help 
careful protein folding 


Protein assembly in vitro is use- 
ful for studying small molecules 
but is problematic for studying 
the assembly of larger, more 
complex proteins. Kim et al. 
analyzed the biogenesis of the 
mutation-prone nucleotide- 
binding domain of the cystic 
fibrosis conductance regulator 
(CFTR) (see the Perspective 

by Puglisi). Newly synthesized 
polypeptides emerged relatively 
slowly from the ribosome and 
folded through a modulated 
pathway that ensured correct 
protein folding. Some parts of 
the protein chain folded 
immediately upon 
synthesis, whereas 
other segments did 
so more slowly. 

It appears that 
acquiring the cor- 
rect conformation 
for this complex 
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protein is partly guided by the 
ribosome itself. — SMH 


Science, this issue p. 444; 
see also p. 399 


CARDIAC PHYSIOLOGY 
Keeping hearts 
at the right size 


If left untreated, high blood 
pressure can lead to abnormally 
enlarged hearts (a condition 
called pathological hypertrophy) 
and heart failure. Inhibitors of pro- 
tein kinase C (PKC) isoforms are 
in development for treating heart 
failure and some cancers. Withal 
et al. report that the inhibition of 
some PKC isoforms may exacer- 
bate heart pathology. During 
development, mice lacking two 
related PKC isoforms, PKC6 
and PKCe, had abnormally large 
hearts and usually died in utero. 
Thus, drugs that inhibit PKCS 
and PKCe could trigger adverse 
cardiac side effects. — WW 

Sci. Signal. 8, ra39 (2015). 


POLYMER CHEMISTRY 
Amore direct way 
to synthesize styrene 


Foam cups, foam pellets, 
plastic cutlery: All are made of 
polystyrene, which in turn is 
made of styrene. The massive 
manufacturing scale of this 
commodity chemical places a 
premium on the efficiency of its 
synthesis. The current industrial 
route requires three steps to 
make styrene from benzene and 
ethylene. Vaughan et al. present 
arhodium catalyst that achieves 
the coupling in a single step by 
using a recyclable copper salt as 
an oxidant. Although the catalyst 
is Slow for industrial application, 
it demonstrates the viability of a 
more direct process. — JSY 
Science, this issue p. 421 


Polystyrene 
pellets 


IN OTHER JOURNALS 


The structured surface 
of a leaf beetle carapace 
diffracts light to produce 
spectacular colors 


APPLIED OPTICS 


Edited by Sacha Vignieri 
and Jesse Smith 


Astretch to change color 


he reflection of white light from structured surfaces 

often results in a spectacular display of color as the 

white light is split into its different wavelengths through 

diffraction. Structure gives rise to the intense iridescent 

colors that distinguish some members of the animal 
kingdom, such as beetles and butterflies. Human-made 
materials, such as DVD or CD surfaces, also diffract light into 
a rainbow. Zhu et al. combine surface structure with membrane 
flexibility to show that they can locally select the color of 
reflected light, as they stretch the membrane and change the 
periodicity of the structure. This technique could be used ina 
range of applications, including camouflage coatings, optical 
sensing and steering, and displays. — ISO 


CELLULAR BIOMECHANICS 
The mechanics of 
cellular left and right 


Cells need to know their own left 
and right in order to coordinate 
with neighboring cells in collec- 
tive movement or embryonic 
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Optica 2, 255 (2015). 


development. To do so, each 

cell has to establish left/right 
asymmetry. Tee et al. studied 
actin organization in human 
cells to understand underlying 
mechanisms, using fluorescence 
and electron microscopy and 
simulations. Actin fibers forming 
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GALAXY EVOLUTION 
Galaxies stripped 
down and evicted 


It is easy to imagine that the 
relatively small and dense com- 
pact elliptical galaxies once had 
more to them. Especially when 
massive galactic neighbors 

are seen nearby that may have 
robbed them of their matter, but 
sometimes there are no such 
neighbors. Chilingarian and 
Zolotukhin have mined survey 
data to show that stripped-down 
galaxies are found in varied 
environments, with and without 
obvious disruptive companions. 
These isolated galaxies may still 
have been tidally stripped and 
then ejected from more crowded 
neighborhoods. — MMM 


Science, this issue p. 418 


RNA IMAGING 


Multiplexed RNA 
imaging in single cells 
The basis of cellular function 
is where and when proteins 
are expressed and in what 
quantities. Single-molecule 
fluorescence in situ hybridiza- 
tion (SmFISH) experiments 
quantify the copy number and 
location of mRNA molecules; 
however, the numbers of RNA 
species that can be simultane- 
ously measured by smFISH has 
been limited. Using combinato- 
rial labeling with error-robust 
encoding schemes, Chen et al. 
simultaneously imaged 100 to 
1000 RNA species in a single 
cell. Such large-scale detection 
allows regulatory interactions 
to be analyzed at the transcrip- 
tome scale. — VV 

Science, this issue p. 412 


SELF-ASSEMBLY 
Creating unusual 
nanostructures 


Self-assembly often occurs 
when dissimilar molecular frag- 
ments are forced together by 
covalent bonding. Surfactants 


410-B 


or block copolymers are two 
common examples. Huang et al. 
grafted four different nanopar- 
ticles, based on polyhedral 
oligomeric silsesquioxanes 

with slightly different composi- 
tions, onto a single tetrahedal 
core (see the Perspective by 
Yang). Depending on the type of 
nanoparticle, they assembled 
into a range of defined, ordered 
supramolecular lattices similar 
to arange of metal alloys. These 
include phases that have higher 
coordination numbers than 
usually found in the packing of 
spherical objects. — MSL 


Science, this issue p. 424; 
see also p.396 


RESEARCH FUNDING 
Proof that peer review 
picks promising proposals 
A key issue in the economics 
of science is finding effective 
mechanisms for innovation. 
A concern about research 
grants and other research and 
development subsidies is that 
the public sector may make poor 
decisions about which projects 
to fund. Despite its importance, 
especially for the advancement 
of basic and early-stage science, 
there is currently no large-scale 
empirical evidence on how 
successfully governments 
select research investments. 
Li and Agha analyze more 
than 130,000 grants funded 
by the U.S. National Institutes 
of Health during 1980-2008 
and find clear benefits of peer 
evaluations, particularly for 
distinguishing high-impact 
potential among the most com- 
petitive applications. — BW 
Science, this issue p. 434 


NEUROBIOLOGY 


SARM1-driven axon 
degeneration 


Axons, the long protrusions of 
nerve cells, are programmed 
to self-destruct under certain 
conditions that occur during 
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development, stress, or disease 
states. Gerdts et al. outline a 
biochemical mechanism that 
controls such axon degen- 
eration. The authors designed 
versions of SARM1 (sterile alpha 
and TIR motif—constraining 1) 
that could be activated or inhib- 
ited in cells. Their experiments 
showed that the activation of 
SARMI1 was necessary and suf- 
ficient to cause axon destruction 
in cultured mouse neurons. 
SARM1-mediated destruction 
was associated with depletion 
of the metabolic cofactor NAD* 
from cells. — LBR 

Science, this issue p. 453 


RIBOSOME 
Force to unblock 
a clogged ribosome 


The synthesis of proteins from 
mRNA by the ribosome is highly 
regulated. But newly synthesized 
protein chains can still block the 
ribosome exit tunnel and slow 
protein synthesis. Goldman et 
al. use optical tweezers to show 
that by pulling on the stuck 
protein chain, they can unblock 
a clogged exit tunnel (see the 
Perspective by Puglisi). In vivo, 
the folding of a nascent protein 
chain just outside the tunnel 
also generates enough force to 
unclog a block, indicating that 
ribosome-peptide interactions 
fine-tune protein synthesis. 
—GR 

Science, this issue p. 457; 

see also p.399 


VASCULAR DISEASE 
Matrikine regulates 
endothelial leakage 


The matrikine acetyl-proline- 
glycine-proline (N-a-PGP) regu- 
lates endothelial permeability in 
inflammatory disease. Xu et al. 
probed the loss of barrier func- 
tion of the endothelium, which 
is characteristic of inflamma- 
tion. Leakiness is stimulated by 
a signaling cascade initiated by 
the modified peptide N-a-PGP 


Published by AAAS 


ie) 


nd involving CXCR2. N-a-PGP is 
an extracellular matrix fragment 
generated by tissue damage. 
-a-PGP induces vascular leaks, 
and its removal attenuates the 
lipopolysaccharide-induced leak. 
-a-PGP is a novel matrikine 
and could be a new therapeutic 
target. — PLY 


Sci. Adv. 10.1126/ 
sciadv.1500175 (2015). 


EVOLUTIONARY BIOLOGY 
Symbionts on the road 
to a single species 


Most species of plants and 
animals live in symbiotic rela- 
tionships with other organisms. 
For example, many insects 

rely on bacteria within their 
cells—endosymbionts—to 
supply nutrients, and pea and 
bean plants have essential 
relationships with bacteria called 
rhizobia to fix atmospheric 
nitrogen. Why do some sym- 
bioses progress to ever-tighter 
partnerships and ultimately 
form a single new organism? In 
a Perspective, Kiers and West 
explore the conditions that favor 
such major evolutionary transi- 
tions in individuality. One key 
factor is the mode of symbiont 
transmission; another is the eco- 
logical context of the symbiosis. 
Once a partnership has reached 
a level of high dependence and 
low conflict, genetic drift alone 
can lead to further integration. 
However, major transitions of 
this kind require strict condi- 
tions to be fulfilled and are thus 
rare. — JFU 


Science, this issue p. 392 


MULTILAYER ASSEMBLY 
Thin-film fabrication 


The deposition of thin films from 
multiple materials is essential 

to arange of materials fabrica- 
tion processes. Layer-by-layer 
processes involve the sequen- 
tial deposition of two or more 
materials that physically bond 
together. Richardson et al. 


sciencemag.org SCIENCE 


review some of the techniques 
and materials that are used 

to make thin films, including 
sequential dip coating, spraying, 
and electrochemical deposi- 
tion. Despite the versatility of 
the methods and the range of 
materials that can be deposited, 
the techniques remain mostly 
confined to the lab because of 
challenges in industrial scaling. 
But because there is tremen- 
dous scope for fine-tuning the 
structure and properties of the 
multilayers, there is interest in 
broadening the use of these 
techniques. — MSL 


Science, this issue p. 411 
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REVIEW SUMMARY 


MULTILAYER ASSEMBLY 


Technology-driven layer-by-layer 
assembly of nanofilms 


Joseph J. Richardson, Mattias Bj6rnmalm, Frank Caruso* 


BACKGROUND: Over the past few decades, 
layer-by-layer (LbL) assembly of thin films 
has been of considerable interest because of 
its ability to exert nanometer control over 
film thickness and its extensive choice of 

usable materials for coat- 
ing planar and particulate 
Read the full article Substrates. The choice of 
at http://dx.doi. materials allows for re- 
org/10.1126/ sponsive and functional 
science.aaa2491 thin films to be engineered 
si ae ie Aa nee oe various ‘anelleations 
including catalysis, optics, energy, membranes, 
and biomedicine. Furthermore, there is now a 
growing realization that the assembly tech- 
nologies substantially affect the physicochem- 
ical properties and, ultimately, the performance 
of the thin films. 


Ge materials 
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ADVANCES: Recent advances in LbL as- 
sembly technologies have explored differ- 
ent driving forces for the assembly process 
when compared with the diffusion-driven 
kinetics of classical LbL assembly, where a 
substrate is immersed in a polymer solution. 
Examples of different assembly technologies 
that are now available include: dipping, de- 
wetting, roll-to-roll, centrifugation, cream- 
ing, calculated-saturation, immobilization, 
spinning, high gravity, spraying, atomization, 
electrodeposition, magnetic assembly, electro- 
coupling, filtration, microfluidics, and fluidized 
beds. These technologies can be condensed 
into five broad categories to which automa- 
tion or robotics can also be applied—namely, 
@) immersive, (ii) spin, (ii) spray, (iv) electro- 
magnetic, and (v) fluidic assembly. Many of 
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these technologies are still new and are ac- 
tively being explored, with research shedding 
light on how the deposition technologies and 
the underlying driving forces affect the for- 
mation, properties, and performance of the 
films, as well as the ease, yield, and scale of 
the processing. 


OUTLOOK: Layer-by-layer assembly has 
proven markedly powerful over the past two 
decades and has had a profound interdis- 
ciplinary effect on scientific research. Scal- 
ing up the process is crucial for furthering 
real-world applications, and moving forward, 
an understanding of how to carefully select 
assembly methods to harness the specific 
strengths of different technologies has the 
potential to be transformative. Comprehen- 
sive comparisons between the technologies 
still need to be conducted, especially in re- 
gard to coating particulate substrates, where 
comparisons are limited but crucial for ad- 
vancing fundamental research and practical 
applications. & 
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Nanofilm 
materials 


Unique 
applications 
as functional 
materials 


Layer-by-layer assembly of nanofilms for preparing functional materials. The properties and performance of the resulting films depend on the 
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MULTILAYER ASSEMBLY 


Technology-driven layer-by-layer 
assembly of nanofilms 


Joseph J. Richardson, Mattias Bjérnmalm, Frank Caruso* 


Multilayer thin films have garnered intense scientific interest due to their potential 
application in diverse fields such as catalysis, optics, energy, membranes, and biomedicine. 
Here we review the current technologies for multilayer thin-film deposition using 
layer-by-layer assembly, and we discuss the different properties and applications arising 
from the technologies. We highlight five distinct routes of assembly—immersive, spin, 
spray, electromagnetic, and fluidic assembly—each of which offers material and processing 
advantages for assembling layer-by-layer films. Each technology encompasses numerous 
innovations for automating and improving layering, which is important for research and 
industrial applications. Furthermore, we discuss how judicious choice of the assembly 
technology enables the engineering of thin films with tailor-made physicochemical properties, 
such as distinct-layer stratification, controlled roughness, and highly ordered packing. 


he performance of functional materials is 

governed by their ability to interact with 

surrounding environments in a well-defined 

and controlled manner. Whether harness- 

ing photons or electrons, separating out gas 
molecules or solutes, or responding to biomol- 
ecules or organisms, the environment-material 
interface is essential in determining the perform- 
ance of the materials in various applications. 
Coating technologies provide the means to con- 
trol the surface of a material, thus creating com- 
posite materials where the interface and the bulk 
of the material can, to a large extent, be engineered 
and controlled independently. 

Layer-by-layer (LbL) assembly is a prevalent 
method for coating substrates with functional 
thin films. Following early studies that reported 
multilayer assembly (J, 2), it is only in the past 
two decades that the field has witnessed con- 
siderable growth (3). Generally, LbL assembly is 
a cyclical process in which a charged material 
is adsorbed onto a substrate, and after washing, 
an oppositely charged material is adsorbed on 
top of the first layer. This constitutes a single bi- 
layer with a thickness generally on the order 
of nanometers, and the deposition process can 
then be repeated until a multilayer film of desired 
thickness has been assembled (3). For certain 
applications the substrate can then be removed, 
yielding freestanding macroscopic films, such as 
membranes (4), or freestanding micro- or nano- 
scopic films, such as hollow capsules (5, 6). Al- 
though electrostatic interactions remain widely 
used in facilitating formation of the films, other 
molecular interactions (e.g., covalent, hydrogen- 
bonding, host-guest) are now well established 
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for LbL assembly, with diverse materials (e.g., 
polymers, proteins, lipids, nucleic acids, nanopar- 
ticles, suprastructures) used as film constituents 
(7). The simplicity, versatility, and nanoscale con- 
trol that LbL assembly provides make it one of 
the most widely used technologies for coating 
both planar and particulate substrates in a di- 
verse range of fields, including optics, energy, 
catalysis, separations, and biomedicine (Fig. 1A). 

The widespread use of LbL assembly in fields 
with different standard tools and procedures— 
as well as the different processing requirements 
associated with substrates such as porous mem- 
branes, particles, and biological matter—has led 
to the development of a number of LbL assem- 
bly technologies. Examples include dipping (2), 
dewetting (8), roll-to-roll (9), centrifugation (J0), 
creaming (1D), calculated saturation (72), immo- 
bilization (13), spinning (14), high gravity (5), 
spraying (16), atomization (17), electrodeposition 
(18), magnetic assembly (19), electrocoupling (20), 
filtration (21), fluidics (22), and fluidized beds 
(23). These different methods have often been 
treated as “black boxes,” where the main focus 
has been on what materials are used (the input) 
for assembling the thin films (the output), with 
little focus placed on the actual assembly meth- 
od. However, there is now a growing realization 
that the assembly method not only determines 
the process properties (such as the time, scalabil- 
ity, and manual intervention) but also directly 
affects the physicochemical properties of the films 
(such as the thickness, homogeneity, and inter- 
and intralayer film organization), with both sets 
of properties linked to application-specific per- 
formance (Fig. 1B). 


Unpacking the “black box” 


The basis of LbL assembly is the sequential ex- 
posure of a substrate to the materials that will 
compose the multilayer films. The assembly tech- 


nologies used to assemble such films form five 
distinct categories, namely: (i) immersive, (ii) spin, 
(iii) spray, (iv) electromagnetic, and (v) fluidic 
assembly (Fig. 2). These assembly technologies 
affect both the process properties and the re- 
sultant material properties (Table 1), and there- 
fore careful choice of the assembly method can 
be crucial for successful application of the assem- 
bled films. Furthermore, two main themes can be 
identified for current developments in assembly 
technologies: The first is the move away from 
random diffusion-driven kinetics for layer deposi- 
tion, and the second is the advancement from 
manual assembly toward automated systems. 


Immersive assembly 


Immersive LbL assembly, sometimes referred to 
as “dip assembly,” is the most widely used meth- 
od and the standard that newer technologies are 
often compared against. Immersive assembly is 
typically performed by manually immersing a 
planar substrate into a solution of the desired 
material (2, 24, 25), followed by three washing 
steps to remove unbound material (26). Partic- 
ulate substrates can also be layered using immer- 
sion; however, the washing and deposition steps 
are generally broken up by centrifugation to pellet 
the particles (5, 6, 10). Early studies on using par- 
ticles for depositing planar multilayers noted 
that, theoretically, any material capable of having 
a surface charge (such as metals, nonmetals, or- 
ganics, and inorganics) could be applied for grow- 
ing multilayers if suitable conditions are used 
(2, 24, 27). Further, it was also reported that the 
thickness of each layer corresponds to the thick- 
ness of the particles being adsorbed (24, 28). Im- 
mersive assembly allows for more homogenous 
films [when using either particle (27) or polymer 
multilayers (3)] in comparison with non-LbL as- 
sembly technologies such as gas deposition and 
nucleation deposition, making LbL assembly 
widely used for thin-film formation. 
Improvements in immersive assembly include 
speeding up the process by shifting the deposi- 
tion kinetics away from random diffusion toward 
faster kinetics, such as those arising from dewetting 
(8), and by automating labor-intensive steps with 
robotic immersion machines (24, 26, 29, 30). The 
colloids used for planar assembly in early studies 
required only 1 min of immersion for each ad- 
sorption step (3D; however, for immersive as- 
sembly using polymers, the substrate is ideally 
immersed for ~15 min for sufficient layer depo- 
sition (25, 26). To reduce the assembly time for 
polymers and to allow for the deposition of low- 
surface charge and/or small-contact area mate- 
rials, solutions doped with organic solvents (e.g., 
dimethylformamide) can be used to eliminate 
the need for rinsing and drying steps through 
the process of dewetting (8). Dewetting leads to 
a ~30-fold reduction in assembly time because 
the adsorption process is no longer governed 
by diffusion but by evaporation and dewetting. 
Another interesting move away from random 
diffusion utilizes polymer solutions that are con- 
stantly stirred by magnetic stirrer bars, which 
allows for robust layers to be deposited within 
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tens of seconds after immersion (32). Instead of 
speeding up the adsorption process by using 
different adsorption kinetics, handling times 
can be decreased by automating the process 
(9, 24, 26, 27, 29, 30, 33). One approach to auto- 
mation uses a quartz crystal microbalance (QCM) 
as a substrate, allowing for layering to be con- 
trolled with a computer-monitored feedback 
loop (30). QCM enables the layering process to 
be based on a fixed mass of adsorbed material 
rather than fixed immersion times. Further- 
more, the feedback loop allows for precise and 
reproducible control over the film growth and 
allows for linear film growth to be engineered 
from polymer combinations that give nonlinear 
film growth using fixed times (30). For fixed- 
time immersive assembly, computer-programmed 
automated slide stainers can be retrofitted for 
automated multilayer assembly, allowing for 
agitation and solution exchange during washing 
steps (24, 29). A similar, although custom-built, 
computer-programmed machine can deposit 
~1000 layers of charged colloids onto particulate 
substrates (substrates ~100 um in diameter) (27). 

Although automation decreases manual in- 
volvement, it does not substantially reduce the 
overall assembly time, which is why some efforts 
have focused on combining faster deposition 
kinetics with automated systems. For example, 
one commercially available robot uses a rotating 
slide holder to speed up the assembly process 
(26). This rotation allows for a 3- to 10-fold re- 
duction in adsorption times and allows for thicker 
films to be prepared using higher rotation speeds. 
Roll-to-roll assembly also allows for layering to 
be performed faster (by 5- to 10-fold), through 
the use of flexible substrates (9). The immersion 
time and speed of the rolling process play a large 
role in determining the film properties, and the 
drying conditions, wettability, and substrate move- 
ment speed require optimization to produce films 
with similar properties to standard immersive 
assembly (9). A further improvement to roll-to- 
roll assembly uses a nip-roll technique to pre- 
vent excess solution from cross-contaminating 
the system, resulting in more homogenous coat- 
ings than immersive assembly (34). 

Immersive assembly can be performed on par- 
ticulate substrates that are too small to sediment 
quickly or physically move between solutions, 
such as micro- and nanoparticles. The most com- 
mon technology for immersive assembly on par- 
ticulate substrates is performed by adding polymer 
solution to dispersed dense particulate substrates, 
pelleting the particles with centrifugation, remov- 
ing the supernatant, washing multiple times with 
a similar pelleting process, and then repeating 
the steps for multilayer growth (5, 6, 10). This 
is generally time-consuming and labor-intensive 
due to the centrifugation steps, and particles 
dense and large enough to be pelleted are 
required. However, by using particulate sub- 
strates lighter than water (e.g., emulsions), 
creaming and skimming cycles can be applied 
for washing steps (17), although centrifugation 
can also be used to speed up the flotation and 
creaming process (35), with lighter emulsions 
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the surfactants used for emulsion stabilization 
(36). 

The major driving force behind the develop- 
ment of immersive assembly technologies for 


capable of creaming in a matter of minutes 
rather than hours (36). The use of emulsions 
as templates results in thicker films compared 
with using solid templates, probably due to 
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Fig. 1. Versatility of layer-by-layer assembly. (A) Schematic overview of LbL assembly and (B) an 
overview showing that the assembly technology influences film and process properties, as well as 
application areas. [Illustration credit: Alison E. Burke and Cassio Lynm] 
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particulate substrates is the attempt to avoid 
centrifugation, as it can lead to aggregation, is 
labor intensive, and is generally difficult to auto- 
mate. A simple way to avoid centrifugation is to 
remove the need for washing steps. This can be 
achieved by adding exact amounts of polymer 


A immersive 
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B Spin 
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calculated to saturate the surface of the partic- 
ulate substrates (12, 37), rather than the high 
concentrations of excess polymer solution gen- 
erally used (5, 6, 10). Initially, only two to three 
layers could be deposited before the particles 
start to aggregate (1/2), but more layers can be 
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Fig. 2. Layer-by-layer assembly technologies. (A to E) Schematics of the five major technology 
categories for LbL assembly. [Illustration credit: Alison E. Burke and Cassio Lynm] 
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deposited by incrementally measuring the zeta 
potential during assembly (37). Additionally, 
the use of constant mixing for soft particulate 
substrates such as emulsions (38) or sonica- 
tion during layer deposition for hard particu- 
late substrates like drug crystals (39) reduces 
aggregation. By optimizing the protocol, the 
saturation method gives a similar shell thick- 
ness to centrifugation-based assembly but is 
about three times faster (37, 39). This technol- 
ogy requires constant monitoring and surface 
area calculations to avoid adding excess polymer 
and therefore does not reduce manual involve- 
ment. A technology that focuses on decreasing 
manual involvement and reducing the need for 
centrifugation uses particulate substrates immo- 
bilized in agarose to convert collections of par- 
ticulate substrates into a macroscopic substrate 
(33). This macroscopic collection of immobilized 
particles can be treated like a planar substrate 
and immersed in polymer solutions using a ro- 
botic dipping machine, allowing for full auto- 
mation during the layering process. Although 
this technology generates films roughly half 
the thickness of those prepared by conventional 
centrifugation-based assembly, probably due to 
the impeded diffusion of polymers through the 
agarose hydrogel, ~80% of the particles can be 
recovered, which is an improvement over the 
~90% loss that has been reported for centrif- 
ugation-based assembly at high layer numbers 
(27,33). 

Due to the ease of use and versatility of mate- 
rial and template choice, immersive assembly 
has been applied for numerous applications. For 
example, light-emitting diodes (LEDs) can be 
prepared from immersive assembly on planar 
substrates, with the polymer choice and multi- 
layer thickness giving control over luminance 
and the turn-on voltage (29). Automated roll-to- 
roll immersive assembly can be used for deposit- 
ing conductive and flame-retardant coatings (34). 
Planar substrates coated with particle multi- 
layers can be used for the detection of small par- 
ticles invisible to the naked eye through color 
shifts in the multilayer films (2). Glass slides can 
also be coated with particle multilayers for the 
preparation of antireflective, antifogging, and 
self-cleaning surfaces (24). Fusion microreactors 
coated with particles are more conducive toward 
reaction (27). Certain particulate substrates easily 
allow for the removal of the template particle, 
leaving behind hollow multilayer capsules. Sim- 
ilarly, drugs themselves can be used as the par- 
ticulate templates, with both types suitable for 
drug delivery, (5, 6, 33, 38-40). 

In summary, immersive assembly is the most 
commonly used LbL assembly technology and 
the de facto standard against which other tech- 
nologies are compared. The simplicity of immers- 
ing substrates of almost any shape or size into 
containers with layering solution makes this 
technology easily accessible. The films produced 
have an interpenetrated structure and form 
“fuzzy” nanoassemblies that are almost synon- 
ymous with LbL assembly (3). Much recent work 
has been focused around shorter assembly times 
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and automated systems with less manual inter- 
vention. For coating particulate substrates, there 
has also been considerable interest in technol- 
ogies applicable to coating smaller, low-density 
particles (such as silica nanoparticles), which 
can be difficult to handle with the conventional 
centrifugation-based assembly. As immersive as- 
sembly typically requires more material than 
other technologies, especially to submerge large 
substrates on industrial scales, waste can be an 


issue, although solutions can be reused as long as 
cross-contamination remains low. Immersive as- 
sembly has been the workhorse of LbL assembly 
and will undoubtedly continue to play an integral 
part in the development of new and improved 
thin films. 


Spin assembly 


Layer-by-layer assembly using spin coating (i.e., 
“spin assembly”) utilizes the common coating 


technology of spinning a substrate to facilitate 
the deposition of materials (74). Although drying 
a substrate after immersive LbL assembly can 
be achieved through spinning (41), the majority 
of spin assembly is performed by either casting 
the solution onto a spinning substrate (42) or 
casting the solution onto a stationary substrate 
that is then spun (43). Spinning quickens the 
assembly process considerably, allowing for lay- 
ers to be deposited in ~30 s due to the various 


Table 1. Selected LbL assembly technologies and properties arising from using each technology. The table is intended to provide a general overview and 
is not exhaustive. NR indicates not reported in selected references. [Illustration credit: Alison E. Burke and Cassio Lynm] 
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LbL ASSEMBLY Substrates Substrate Layer materials Time per Automated Layer Roughness, Layer 
TECHNOLOGY sizes* layer (current status) thickness, nm? nm structure 
Immersive 10 nm-1 m 10s-12h <1-15 1-20 Interpenetrated 
7 Dipping (2, 25) Planar 1-100 mm Polymers, colloids 10-30s (32) or Yes 1-2 (25, 26) 1-10 (44, 49) 
LZ =~ 10-20 mint (26) 
| Dewetting (8) Planar 1-10 mm Polymers, colloids§ 30-60 s (8) No 1-2 (8) NR 
S— Roll-to-roll (9) Flexible planar 100 mm-1 m Polymers 2-5 min (9, 34) Yes 1-15 9, 34) 15-20 (9, 34) 
Centrifugation (5, 6) Particulate 10nm-10 yum Polymers, colloids 20+ minl| (5, 6) No 1-2 (6, 10) 3-10 (23, 94) 
Calculated saturation Particulate 100 nm-1 um Charged polymers 5-10 min (39) No 1-2 (39) NR 
(12, 37) 
Immersive immobilization Particulate 100 nm-1 pm Polymers 40-50 min (33) Yes <1 (33) NR 
(33) 
Creaming (11) Emulsion 10 nm-1 ym Polymers, colloids 0.5-12h (11, 36) No 1-7 (36, 38) NR 
Spin 1-100 mm 10 s-5 min <1-2 1-10 Stratified 
Pp =e Spin (14, 42, 43) Planar 1-100 mm Polymers, colloids 10-60 s (43) Yes <1-2 (42) 1-10 (44, 46) 
SSH ich gravity (15) Planar 1-10 mm Polymers, colloids 20s-5 min (15, 49) No NR 1-2 (49) 
Spray 10 nm-10 m <1s-24h <1-15 1-10 Stratified 
\ Spray (16, 52) Planar 1mm-10m Polymers <1-30 s (59) Yes <1-5 (16, 51) 1-10 (57) 
= Atomization (17) None 10-100 nm Charged polymers 12-24h(17) No 5-15 (17) NR 
4 Spray immobilization (63) Particulate 10-100 nm Polymers 5-10 s (63) Yes 2-4 (63) NR 
Electromagnetic 10 nm-100 mm 1s-20 min 1-20,000 10-30 Stratified 
= Electrodeposition (18, 20, Planar 1-100 mm Polymers, colloids 1s-20min(65, 71) No 2-20,0001 10-30 (66, 70) 
’ 67, 73) (20, 67, 71) 
Magnetic (79, 76) Planar and 10nm-100 mm _ Polymers, colloids 15-20 min (19, 76) No 1-2 (19, 75) NR 
se Particulate 
Electro-immobilization (13) Particulate 10nm-1 pm Charged polymers 15-20 min (13) No 2-3 (13) NR 
Fluidic 100 nm-100 mm 10 s-45 min <1-3 1-11 NR 
Soe _ Microfluidic planar (22) Planar 10 um-100 mm = Polymers 1-15 min (80, 87) Yes <1-3 (83, 84) 1-10 (80, 84) 
Microfluidic particulate Particulate 100nm-10um Polymers 10-60 s (96-98) Yes 1-3 (96,97) NR 
(96-98) 
Fluidized bed (23) Particulate 1-10 pm Polymers 3-5 min (23) No 2-3 (23) 9-11 (23) 
Fluidic immobilization (86, Particulate 100nm-1 ym Polymers, colloids 5-45 min (86, 87) No 1-2 (87) NR 
87) 
Vacuum/filtration (27) Particulate 100 nm-1 ym Polymers 10-20 min (95) Yes 1-2 (94) 5-10 (94) 
and fragile” 


*Typical order of magnitude substrate sizes are indicated. Larger or smaller substrate sizes are possible. 


tTypical thicknesses per layer for linearly growing 


films are indicated. Per-layer thicknesses for exponentially growing films vary widely, given the nonlinear growth profile. +Time with or without agitation, 
respectively. §Dewetting can make use of materials that are usually difficult to layer (e.g., materials with low charge or with low surface 
contact). ||Centrifugation processing time is highly variable due to manual pipetting and resuspension steps. Thickness is dependent on time. **Fragile 
substrates, such as mammalian cells, can be layered using fluidic filtration. 
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forces governing the process (43). Furthermore, 
spin assembly allows for automation and the coat- 
ing of substrates up to 10 cm in diameter using 
commercially available spin coaters (44, 45). How- 
ever, standard spin coaters are generally designed 
for flat surfaces and are not amenable to the com- 
plex shapes accessible to immersive assembly. 
Spin assembly typically results in more homoge- 
nous films compared with immersive assembly. 
This is because assembly is driven by a collection 
of forces including electrostatic interactions, which 
cause the adsorption and rearrangement of poly- 
mers, and centrifugal, air shear, and viscous forces, 
which cause desorption of weakly bound poly- 
mers and dehydration of the films (43). These 
forces are also the reason why spin assembly 
can be orders of magnitude faster than immersive 
assembly. The salt concentration of the polymer 
solution has a larger effect at higher spin speeds, 
meaning that electrostatic forces play a greater 
role at low ionic strength, and shear forces domi- 
nate at high ionic strength (46). These shear 


forces produce thinner, highly ordered films with 
specific layer interfaces when compared with im- 
mersive assembly, which produces thicker inter- 
penetrated films (47). Specifically, the thickness 
for spin-assembled polymer films is generally 
linked to the spin speed, with higher speeds lead- 
ing to thinner films (42). When depositing col- 
loids, the forces experienced during spinning 
lead to a monolayer of colloids, whereas standard 
immersive assembly often leads to a pseudo- 
monolayer in which the substrate is not fully 
coated (43, 48). A comparison study of the dif- 
ferences between automated immersive assem- 
bly and automated spin assembly found that 
immersive-assembly prepared thicker, rougher 
films, whereas spinning resulted in thinner, 
smoother films (44). The films differed visu- 
ally, as the spin-assembled films were transpar- 
ent because of their distinct-layer stratification, 
and the immersive-assembled films were opaque 
due to their inhomogeneous, interpenetrated lay- 
ers (Fig. 3). The contact angle and the relative 


A Creating different nanofilm properties using immersive vs spin LbL assembly 
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Fig. 3. Comparison between immersive and spin assembly. (A) Schematic comparison with layers 
of different materials. (B) Comparison between an immersive-assembled film (left) and a spin- 
assembled film (right). Films are made of hydrophobically modified poly(ethylene oxide) and poly(acrylic 
acid). [Adapted with permission from (44). Copyright 2008 American Chemical Society.] [Illustration 
credit: Alison E. Burke and Cassio Lynm] 
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concentration of polymers were consistent across 
all bilayers for spin-assembled films, whereas 
immersive-assembled films became rougher with 
time, giving varying contact angle and relative 
concentration ratios between the two constitu- 
ent layers (44). Another study, which compared 
spin assembly and immersive assembly, showed 
that clay nanocomposites in spin-assembled films 
have a higher degree of orientation (45). How- 
ever, one issue that can result from spin assem- 
bly, which is not a concern for other assembly 
technologies, is that at higher ionic strengths of 
polymer solution, and also at lower spin speeds, 
the films can be thicker where the solution was 
cast when compared with the edges of the sub- 
strate (42, 46). 

In a special case of spin assembly, the sub- 
strate can be placed in a closed container with a 
polymer solution or a colloidal dispersion paral- 
lel to the axis of rotation (rather than perpendicu- 
lar). Upon spinning, centrifugal force pushes the 
layer material directly onto the substrate rather 
than across the substrate, hence the name “high- 
gravity assembly” (15). This allows for improved 
film deposition and uniformity, especially at low 
polymer concentrations, because the rotation 
and increased turbulence lower the thicknesses 
of both the laminar layer and the diffusing layer 
around the substrate. The adsorption equilibrium 
can be reached at least five times faster than im- 
mersive assembly and is controllable by the spin 
speed. Furthermore, polymer combinations that 
grow exponentially using immersive assembly also 
grow linearly using this technology. Similarly, 
the roughness is much lower (~2- to 10-fold) for 
LbL films assembled in this way (49). 

Spin assembly typically produces substantially 
more organized films and multilayers than im- 
mersive assembly, which has made it a useful 
tool in preparing optical coatings with controlla- 
ble and homogenous color (/4) and for preparing 
transparent films (44). Similarly, spin assembly is 
useful for preparing LEDs with higher luminance 
than immersive assembly (47). A primary limita- 
tion for spin assembly in terms of application is 
that it is limited to coating small planar sub- 
strates, as increasing the substrate size requires 
higher spin speeds. Furthermore, spin coating 
of nonplanar surfaces is complicated. 

In summary, spin assembly uses rotating sub- 
strates to deposit layers and remove excess coat- 
ing material. Spin assembly typically produces 
thinner, more organized, and more stratified 
multilayers than does immersive assembly, and 
the process can be much faster. The spin coater 
needed for assembly is commonly accessible in 
many research environments and even some in- 
dustrial settings, such as with the robotic wafer 
processing common in the semiconductor indus- 
try, which could facilitate translation from the 
laboratory to real-world applications. Further- 
more, depositing multilayer films on nonflat 
surfaces, or even flat but rough surfaces, can be 
challenging due to the shear forces involved with 
film assembly. Nevertheless, the film and process 
properties arising from spin assembly, includ- 
ing smooth films assembled in a relatively short 
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time, continue to make this method an attract- 
ive choice. 


Spray assembly 


Spray LbL assembly is another assembly cat- 
egory, where films are assembled by aerosoliz- 
ing polymer solutions and sequentially spraying 
them onto substrates (76). Although spraying air 
has been used to dry films during LbL assembly 
to reduce contamination (and align carbon nano- 
tubes) (50), here we discuss spray assembly solely 
in the context of layer deposition. Standard spray 
assembly is much faster (as quick as ~6 s per layer) 
than immersive assembly (57) and approaches 
an industrial level far surpassing that of spin 
assembly (52, 53). Vacuum can also be used to 
further speed up the process by minimizing the 
lag time between spraying and washing, and 
vice versa, and to facilitate the spray coating 
of three-dimensional (3D) objects like mem- 
branes (54). 

In spray assembly, the film properties—such 
as the morphology, uniformity, chemical compo- 
sition, and selective membrane properties—can 
be tailored to be similar to those prepared by im- 
mersive assembly, with the film thickness influ- 
enced by suspension concentration, spray flow 
rate, spray duration, resting duration, whether 
or not the substrate is washed and for how long, 
and whether the solution is sprayed vertically or 
horizontally (16, 51, 55-57). This control arises 
from the two main forces governing the spray 
assembly process, namely, bulk movement in the 
actual spray and random movement in the liquid 
film (56). The random movement in the liquid 
film allows for polymer rearrangement and gen- 
erates much higher convection close to the sub- 
strate, allowing for improved deposition. This is 
because of the submicron thickness of the liquid 
film at the substrate interface and because of the 
speed at which the spray contacts the substrate 
(16, 56). Washing the substrate generally produces 
thicker films than leaving the substrate unwashed, 
due to polymer rearrangements during washing 
(SD). Like spin assembly, the films resulting from 
spray assembly have more distinct layers in com- 
parison to immersive assembly (J6, 53). 

Spray assembly has also been combined with 
other technologies to leverage technology-specific 
advantages and automate the assembly process. 
For example, a disadvantage of spray LbL assem- 
bly is that the obtained films may not be homo- 
geneous due to the effects of gravity draining, 
causing increased deposition in the vicinity of the 
solution drips, and because of irregular patterns 
caused by the spray nozzles at certain distances 
(51, 57). To address this problem, rotating the 
substrate during spray assembly allows for the 
preparation of more homogeneous films and 
subsecond spray times for each layer (56, 58, 59). 
By spraying rotating substrates, a majority of 
the polymer added to the substrate is adsorbed. 
In comparison, the vast majority of polymer re- 
mains in the coating solutions after immersive 
assembly. Therefore, applicable concentrations 
roughly 10 to 50 times less than those required 
for immersive assembly can be used for spray 
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assembly on rotating substrates (55, 59). Larger 
3D substrates, such as tubular membranes, can 
also be coated by rotating the substrate during 
spraying (60). A further improvement has been 
the computer-aided automation of spray assem- 
bly to reduce manual processing (58, 60). Similar 
to the use of automated immersive assembly on 
QCM chips, the use of QCM chips for automated 
spray assembly enables feedback loop control 
and tracking of real-time film growth (67). Auto- 
mated spray assembly has also been combined 
with roll-to-roll processing for coating industrial- 
size substrates (i.e., substrates that are tens of 
meters long) (62). Roll-to-roll spray assembly can 
also be used to coat particulate substrates with 
multilayer films by performing spray assembly 
on particulate substrates immobilized on top of 
a dissolvable surface (63). 

A stand-alone spray assembly technology for 
coating particulate substrates uses surface acous- 
tic waves of 1 to 10 nm in amplitude to atomize 
polymers and cargo (/7). As the atomized droplets 
move through the air, the solvent evaporates and 
the polymer condenses into particle form, result- 
ing in the first atomized solution becoming the 
template for subsequent coatings, with ~1000 car- 
riers produced from each microliter of solution. 
The particles are dialyzed to remove excess poly- 
mer, added to a solution of oppositely charged 
polymer, and then re-atomized to coat the par- 
ticles. This process can be repeated for multi- 
layer assembly; however, the dialysis process 
increases the processing time of this technology 
to ~24 hours for each layer. 

Spray assembly has found use for a wide vari- 
ety of applications because it can be used to coat 
industrial-scale substrates with relative ease (62) 
and is not limited to planar substrates (54, 60, 63). 
Spray assembly has been used to prepare flame- 
retardant films over cotton cloth, where it was 
shown that spraying on vertically oriented sub- 
strates produced superior flame-retardant films 
compared with both spraying on horizontally 
oriented substrates or dipping (57). Clothing ma- 
terial was also coated with spray assembly to 
control air flux and provide chemical protec- 
tion, potentially for use with military uniforms 
(54). Like other assembly technologies, spray as- 
sembly has been used to prepare antireflective 
coatings (67), and similarly, car tinting with struc- 
tured coloring to reduce heating from infrared 
light (62). Membrane tubes could also be coated 
to improve the separation of organic dyes from 
water (60). Because the structure of the films can 
be controlled at the nanometer-level by the spray 
time, spray assembly can be used to control con- 
ductance in thin films in ways that are not avail- 
able to other assembly technologies (59). Spray 
assembly has been used to prepare particles to 
examine cellular uptake of different coatings 
and aspect ratios of particles (63) and for gene 
delivery in vitro (17). Spray assembly has found 
use in diverse applications and industry because 
it offers rapid assembly times and is amenable 
to both automation and scale-up. 

In summary, spray assembly produces multi- 
layer films by aerosolizing coating solutions and 


spraying them onto the substrate. The resulting 
films are typically well organized with distinct 
layers. Spray assembly is a quick and easy meth- 
od to coat large or nonplanar substrates, although 
immersive assembly remains the method of choice 
for coating complex 3D substrates. Spray assem- 
bly is one of the most highly relevant technol- 
ogies for industrial applications, as it is already 
widely used in industry. 


Electromagnetic assembly 


Electromagnetic assembly is based on the use of 
an applied electric or magnetic field to effect lay- 
ering, such as by coating electrodes in polymer 
solutions or by moving magnetic particulate sub- 
strates in and out of coating solutions (/8, 19). The 
former, commonly referred to as electrodeposi- 
tion, is a well-established technology for coating 
materials using an applied voltage in electrolytic 
cells. In the standard electrodeposition setup, 
two electrodes are immersed in polymer solution, 
then an electric current is applied. The electrodes 
are then washed and placed into solution of an 
oppositely charged polymer; the polarities of the 
electrodes are reversed, and the process is re- 
peated (64). Electrodeposition can be used to 
rapidly assemble ions, polymers, and colloids in 
much less time than in immersive assembly (78). 
For example, bimetallic mesoporous LbL films 
can be prepared by electrodeposition, with the 
electrodeposition time determining the layer 
thickness at ~1.5 nm/s (65). In another setup, the 
substrate can be placed between the two elec- 
trodes, allowing for planar substrates to be coated 
(66), or even immobilized particles (13). This tech- 
nology results in films roughly twice the thickness 
of those resulting from centrifugation-based as- 
sembly. Electrodeposition can also use higher 
voltages, upwards of 30 V (13, 66); however, the 
assembly process for immobilized particles can 
take as long as 15 min per layer (13). 

The thicknesses of the electrodeposited films 
are directly related to the voltage used during 
assembly, with the optimum voltage for achiev- 
ing the thickest films dependent on the pH of 
the polymer solution (67). Higher voltages can 
cause desorption of the film as the electrode 
(i.e., the substrate) begins to repel the previously 
deposited layer. Generally, pH values lower than 
the pK, (where K, is the acid dissociation con- 
stant) of the polymers need lower voltages to 
reach peak thickness, and that peak thickness is 
also larger than the peak thickness at higher pH, 
closer to or above the pK, of the polymers. How- 
ever, if the voltage is raised high enough, a sec- 
ondary peak thickness can be reached, allowing 
for the assembly of films at pH values otherwise 
difficult to grow using other technologies (67). 
The reason for this “valley” in thickness is that 
at high voltages, the electrolysis of water at the 
electrode plays a bigger role in hindering poly- 
mer adsorption; however, at even higher volt- 
ages (>3 V) the electrostatic interaction between 
the polymer and electrode exceeds any hin- 
drance due to electrolysis (67). For example, 
polymer-enzyme films are roughly twice as 
thick when assembled at an optimal voltage of 
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1.2 V, compared with using lower or higher vol- 
tages (below 3 V) (68). These studies show that 
electrodeposition is similar to other LbL assemb- 
ly technologies in the sense that the pH of the 
polymer solution, and therefore the configura- 
tion of the polymer itself, is crucial in controlling 
film thickness. 

At higher voltages, electromagnetically as- 
sembled films are more interpenetrated than 
immersive-assembled films, which is in contrast 
to the highly stratified films prepared at lower 
voltages (66). When forming polymer-polymer 
films, the refractive index does not change sub- 
stantially during film growth in a flow cell under 
an electric current, suggesting a more homoge- 
nous deposition than in immersive assembly 
(69). Similarly, polymer-colloid films show high 
organization, as the refractive index decreases 
and transmittance increases when assembling 
films under higher voltages (70). Correspond- 
ingly, electrodeposited enzyme-polymer films are 
more uniform than immersive-assembled films, 
with 90% coverage of the substrate versus ~50% 
coverage, respectively (68). Because of this stratifi- 
cation and high surface coverage, electrodeposi- 
tion allows for control over the spacing between 
layers (68). 

Electrodeposition can also be achieved by 
using local effects at the electrodes, such as in- 
ducing redox reactions or changes in pH. The 
DH of the solution near the anode and cathode 
changes markedly from bulk solution to lower 
and higher pH values, respectively (71). The low 
DH near the anode can induce polymer deposition. 
However, this pH-induced electrodeposition is 
fairly limited, as only a few bilayers can be de- 
posited (using materials such as alginate and 
chitosan) because the layers become too thick 
(tens of micrometers) for the electric current to 
penetrate, resulting in no pH change and there- 
fore no deposition. Using a similar principle, 
covalently stabilized films can be prepared by 
generating copper(I) from copper(II) in situ 
at the electrode (i.e., the substrate) for cross- 
linking azide- and alkyne-containing polymers 
with copper-catalyzed “click” reactions (20). 
Polymers containing electrically sensitive click 
groups can also be electrocoupled, allowing for 
500-nm-thick transparent, and therefore strati- 
fied and homogenous, films to be prepared in 
~30 min (72). One-pot synthesis can be per- 
formed using the same basic principles by 
switching between oxidative and reductive reac- 
tions by alternating the voltage, allowing for wash- 
free assembly using electropolymerization (73). 

Magnets, rather than electric currents, can be 
used to assemble LbL films on sensitive partic- 
ulate templates, such as emulsions (74), or small 
templates difficult to pellet through centrifuga- 
tion, such as sub-10-nm iron oxide nanoparticles 
(19). Template particles containing magnetic nano- 
particles can be separated from the polymer 
solution using a magnet, which, similar to the 
filtration method, allows for nearly 100% of 
the particles to be recovered in a centrifugation- 
free LbL assembly process (75). Magnets or an 
external field can also be used to orient layered 
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magnetic nanoparticles on a planar substrate so 
that a subsequent layer of nanoparticles can de- 
posit more rapidly and in an oriented fashion 
(76). This technology uses standard immersive 
assembly for the deposition of positively and 
negatively charged magnetic nanoparticles with 
application of a magnetic field between depo- 
sition steps. Therefore, the thickness does not 
increase in relation to standard immersive as- 
sembly; however, the absorbance of the film in- 
creases with application of the magnet, suggesting 
increased packing (76). 

Electromagnetic assembly has found use in 
several different applications, as it can be used to 
form LbL films with compositions that are not 
readily assembled using other technologies. Bi- 
metallic films of Pt and Pd layers have Brunauer- 
Emmett-Teller surface areas of ~40 m? g¢? and 
therefore exhibit enhanced electrochemical activ- 
ity in the methanol oxidation reaction, compared 
with single-layer films (65). Antireflective coat- 
ings can be prepared by adjusting the refrac- 
tive index of the films by assembling the films at 
different voltages (70). Biological applications have 
also been explored, as biocompatible coatings 
can be formed using electromagnetic assembly, 
with in vitro tests confirming negligible cyto- 
toxicity (77). Bienzyme films with bioelectric cata- 
lytic properties have higher surface coverage, and 
therefore activity, when compared with tradi- 
tionally prepared films (68). The stratification of 
the assembled films is also conducive toward 
high-performance photoelectric devices (72) and 
separation membranes (66). Hollow polymer 
capsules (from micrometers to sub-100 nm in 
diameter) can also be prepared using electro- 
deposition on immobilized particles (73). 

In summary, electromagnetic assembly uses 
electric or magnetic fields, typically in the form 
of electrodes in polymer solutions or magnetic 
particulate substrates, to deposit films. Electro- 
magnetic assembly can exploit current-induced 
changes in pH or redox-reactions to effect film 
assembly, thus using a driving force substantially 
different from that of the other main assembly 
categories. Generally, electromagnetic-assembled 
films are thicker and more densely packed than 
films prepared using other LbL assembly meth- 
ods (13, 68). Electromagnetic assembly is still not 
as common as some of the other technologies, 
and even though it requires special equipment 
and expertise, it does offer a different approach to 
multilayer film assembly (e.g., through magnetic 
handling of substrates and materials or through 
electrically induced assembly), thereby providing 
alternative opportunities for assembling films. 


Fluidic assembly 


Fluidic assembly can be used to deposit multi- 
layers with fluidic channels, both by coating the 
channel walls and by coating a substrate placed 
or immobilized in a fluidic channel (77). The gen- 
eral method involves using pressure or vacuum 
to sequentially move polymer and washing solu- 
tions through the channels, which can be fluidic 
components, such as tubing or capillaries, or 
designed microfluidic networks (78, 79). Flow- 


chamber-based QCM is a common fluidic assem- 
bly technology used for investigating thin-film 
properties and multilayer growth by providing 
crucial real-time information (22). Higher con- 
centrations of polymer solution typically yield 
thicker films (79), with the contact time rather 
than the flow rate as the crucial factor deter- 
mining the amount of adsorbed polymer under 
flow (80). 

Fluidic assembly is typically implemented using 
a pump, capillary forces, or spinning to trans- 
port the liquid through the channels, although 
pipetting and static incubation can also be used. 
However, fluidic assembly strongly resembles 
immersive assembly when polymer solutions 
are allowed to remain in static contact with the 
substrate for more than 10 min (8/7, 82). Polymer 
and washing solutions loaded into channels with 
a pump or vacuum can deposit ~1.5-nm-thick 
layers in 5 to 10 min (83). Capillary forces can 
also be used to pull polymer solutions through 
microfluidic channels by placing droplets of 
solution at fluidic inlets, followed by spinning 
the substrate to remove the solution, allowing for 
~1.2-nm-thick layers to be deposited in less than 
2 min (84). Fluidic layering based on capillary 
forces is easy to implement, as capillary action 
does not require external active components, 
but it is not suitable for larger volumes or when 
dynamic control over the flow rate is needed. 

Fluidic devices and perfusion chambers can 
also be used to achieve region-specific fluidic as- 
sembly or to perform fluidic assembly on more 
complicated 3D structures. For example, complex 
automated microfluidic devices can be used to as- 
semble hundreds of layers in parallel using cap- 
illary flow and vacuum to fill and empty multiple 
channels (85). This enables the high-throughput 
screening of film libraries using small quantities 
of materials, as only a droplet is needed to fill a 
single microchannel. Region-specific films can 
be coated on substrates by affixing a geometric 
chamber over the substrate and then flowing 
the solution through the chamber and over the 
substrate (80). Perfusion chambers can be used 
for fluidic layering on complex 3D substrates such 
as sensitive biological substrates (like arteries), 
which must remain constantly hydrated during 
layering (86). Similarly, perfusion chambers can 
be used to hold agarose that contains immobilized 
particles for fluidic assembly (87). This technology 
not only allows for the deposition of polymers but 
also for the deposition of larger cargo, such as 
gold nanoparticles or liposomes, and produces 
films with nearly identical thickness to those pre- 
pared by standard centrifugation-based assem- 
bly (87). 

Vacuum is typically used with other assembly 
technologies, such as spray assembly, or to re- 
move the solution from channels in fluidic as- 
sembly, but it can also be used to form multilayers 
in a macrofluidic-type assembly, especially on un- 
usual substrates like aerogels. Aerogels can be 
functionalized using vacuum assembly by pour- 
ing solutions of conducting polymers, biomole- 
cules, or carbon nanotubes from the top and 
applying vacuum to pull these solutions down 
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through the aerogel (88). Vacuum assembly 
can also be used to deposit materials, such as 
reduced-graphene oxide, that would otherwise 
pose a challenge to the creation of uniform multi- 
layers (89). For particulate substrates, vacuum as- 
sembly can be performed using separating filters, 
down to 200 nm in pore size, for centrifugation- 
free layering (27, 90). Vacuum is not applicable 
for all sensitive templates; however, for template 
particles including emulsions (9), cell islets (92), 
or calcium carbonate nanowires (93), a slight 
vacuum (~100 mbar) can facilitate the layering 
process (27). Less than 3% of the particles are lost 
during assembly using an optimized procedure, 
far less than with the calculated saturation- 
based method, where ~50% can be lost, or the 
centrifugation-based method, where more than 
80% can be lost at high layer numbers (21). This 
combined filter-and-vacuum assembly technol- 
ogy yields a layer thickness of ~1.3 nm and a 
surface roughness of ~5 to 10 nm, which are both 
similar to those prepared via centrifugation-based 
assembly (94). A filtration setup has also been 
automated for coating cell islets, using a feed- 
back loop for evacuating the fluid from the re- 
action chamber, thereby reducing the manual 
handling time by ~60% (95). 

Like vacuum assembly, fluidic assembly is not 
restricted to planar substrates and is a viable 
alternative for centrifugation-free assembly on 
particulate substrates (96-98). Many fluidic as- 
sembly approaches coat emulsions or liquid crys- 
tals, as these materials are well studied in the 
fluidics field. Generally, the coating and washing 
solutions are deflected past the flow of particles 
by using physical gaps smaller than the particles 
so that the flow can enter perpendicular (96) 
or parallel to the particle flow stream (97). For 
fluidic assembly in parallel flow systems, larger 
template particles (~50 um in diameter) are nec- 
essary, as they can be deflected in a zig-zag pat- 
tern using solid pillars at a ~45° angle to three 
parallel laminar flow streams: solution A, wash- 
ing, and solution B. This gives a layer thickness 
of ~2 to 3 nm (97). A similar technology can be 
used to coat 15-1m beads with avidin and biotin, 
where higher deflection angles have a high cor- 
relation to failure rates, with angles of 1° optimal 
for a failure rate of virtually zero (99). Instead of 
pillars, specific geometries can be used to catch 
emulsions for the fluidic assembly of lipid layers 
(98). For coating lipid particles with polymers, 
tangential flow filtration can be used as a type 
of expedited dialysis for removing excess poly- 
mer solution (100). To coat sufficiently large 
and/or dense particles, a setup based on fluid- 
ized beds can be used. This allows for assem- 
bly that is ~eight times faster when compared 
with centrifugation-based assembly and produces 
films twice as thick (23). In this instance, the 
force of the washing or polymer solution lifting 
the particles is balanced against the force of 
gravity sedimenting the particles, resulting in a 
fluidized bed where washing and polymer solu- 
tions can be pushed past the particles. A similar 
setup can be used to coat larger (>100-um) par- 
ticles in packed columns, although these beds do 
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not need to be fluidized due to the large particle 
size, and gravity rather than a vacuum or pump- 
driven fluidics can be used to pull the coating and 
washing solutions through the column (1). 

Numerous applications have been introduced 
during the process of developing technologies 
for fluidic assembly. Many applications, such 
as improved capillary electrophoresis, are real- 
ized inside capillaries (87). Fluidic assembly can 
also be used to engineer complex flow patterns, 
such as having flow in opposite directions in the 
same capillary, simply by changing the outer 
coating of the capillary walls and generating 
flow with an electric current (82). Fluidic assem- 
bly is not limited to planar substrates: For ex- 
ample, multilayer coatings can be prepared on 
aerogels, resulting in improved compressive 
strength, wet-state super elasticity, fluorescence, 
and mechano responsive resistance, while also 
creating high charge-storage capacity (88). Dam- 
aged aortic porcine arteries can be repaired ex 
vivo with fluidic assembly, to protect the artery 
against unwanted blood coagulation, as well as 
to facilitate healing (86). Similarly, catheter tubing 
can be coated with antifungal multilayers to re- 
duce fouling (78). Chromatography beads coated 
with multilayers of particles increase the surface 
area of the beads, thereby improving chromatog- 
raphy (J). Although fluidic assembly is typically 
performed on larger particles (tens or hundreds 
of micrometers in diameter), smaller particles 
(below ~5 um in diameter) can be coated and 
loaded with functional cargo for potential drug 
delivery applications by combining microflu- 
idics with immobilization (87). Fragile particu- 
late substrates like emulsions can also be coated 
with lipids, using fluidic assembly for the gen- 
eration of synthetic cells (98). Neuronal cells can 
be patterned with fluidic assembly (83), and cell 
islets can be coated to improve robustness, allow- 
ing for in vivo transplantation (92, 95). Fluidic 
assembly functions as a valuable tool for coating 
sensitive particulate substrates, like mammalian 
cells, that may be damaged using other technol- 
ogies, such as during handling in centrifugation- 
based assembly. 

In summary, fluidic assembly provides the 
means to assemble multilayers on surfaces not 
easily accessible to other methods (e.g., inside cap- 
illaries), provides new ways for region-specific 
patterning (e.g., by masking a surface with a 
fluidic channel), and increases the industrial 
capacity of multilayer assemblies (e.g., through 
parallelization of film assembly and decrease 
of reagent consumption). Although the special- 
ized equipment and expertise required to set up 
(micro)fluidic systems can complicate the use of 
fluidic assembly, these advantages make it at- 
tractive for many applications. 


Challenges both big and small 


Over the past two decades, LbL assembly has un- 
dergone an explosive growth in usable materials 
and substrates. When taken together with all of 
the different assembly technologies available, it 
becomes obvious why LbL assembly is prevalent 
across a broad spectrum of disciplines. Despite 


this extensive toolbox, relatively few multilayer 
films have had widespread impact outside of 
research environments. One focus for industrial 
applications is the identification of reliable, sca- 
lable, and resource-effective assembly processes, 
although this may require different approaches 
for macroscopic substrates and for microscopic 
particulate substrates. 

For macroscopic substrates, improved high- 
throughput assembly methods for conformal 
coatings will play a key role. Immersive and spray 
roll-to-roll assembly are industrially relevant but 
only readily applicable to flexible planar substrates; 
therefore, innovation is needed in systems that 
can be easily scaled for coating large or numerous 
3D macroscopic substrates. Similarly, reducing 
material waste during the coating process re- 
mains important, especially for valuable coating 
materials like biomolecules and custom polymers. 
Another challenge for films intended for in vivo 
biomedical application, such as drug delivery and 
tissue engineering, is ensuring sterility of the 
product. This is typically achieved through steri- 
lization (heat, ultraviolet light, chemical treat- 
ment, etc.) just before use, which can affect film 
properties and performance. Finally, increasing 
the reliability and reproducibility of the films— 
for example, by increasing automation and reduc- 
ing manual intervention—is crucial for extending 
knowledge about film properties and assembly 
technologies and also for applying the multi- 
layer films in real-life applications. 

Similar challenges exist for particulate sub- 
strates. One crucial difference is that several 
particulate assembly methods depend on cen- 
trifugation, which remains difficult to scale or com- 
bine with minimal-intervention high-throughput 
assembly. Furthermore, yield and size ranges 
need to be specified for the various technolo- 
gies, as these details are often not determined. 
Detailed film properties (such as layer interpen- 
etration, layer density, film stability or respon- 
siveness, and permeability) that have primarily 
been studied on planar substrates also need to 
be investigated so that further comparisons be- 
tween planar and particulate substrates can 
be drawn. Altogether, these challenges are not 
trivial and require focused efforts to overcome; 
they are also not unique to the field of LbL as- 
sembly. One way to address these challenges is 
to continue to be open and look for solutions in 
new and sometimes unexpected areas, both in 
neighboring and more distant fields; this has 
underpinned much of the technological innova- 
tion in LbL assembly. 


Opportunities: Thinking outside the box 


Layer-by-layer assembly is a firmly established 
technology and shows great promise in multi- 
ple, diverse fields. Much of the development up 
until now has focused on using new molecular 
driving forces for film assembly, thus enabling the 
use of a suite of substrates and layer materials. 
However, this enormous potential still remains 
largely limited to small-scale research settings 
and requires technological and methodological 
innovation. Despite a surge of new technologies, 
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many being recent developments, unmet chal- 
lenges still remain, both for harnessing the 
specific strengths of different technologies for 
particular applications and also for develop- 
ing new and improved technologies. 

Although much work has been undertaken on 
establishing new assembly technologies, only a 
few studies have chosen a specific assembly tech- 
nology for the material properties generated (e.g., 
stratification, density, roughness) rather than 
the processing properties used (e.g., ease-of-use, 
material and time savings, lowered involvement, 
larger batches). For example, for applications 
where electrical conductivity is important (such 
as fuel cells and batteries), the conductivity of 
an immersive-assembled film can be superior 
to that of spray-assembled films, which can be 
explained by differences in the interlayer orga- 
nization of the constituent conductive layer ma- 
terials (interpenetration versus stratification) (59). 
Conversely, for applications where optical clar- 
ity and/or wetting behavior is important, spin- 
assembly can allow for an optically transparent 
film with well-controlled water-contact angles 
to be assembled due to the smooth, stratified 
layers formed, whereas an immersive-assembled 
film can be translucent and with a contact angle 
that drastically changes depending on the num- 
ber of layers deposited due to the rough, inter- 
penetrated layers formed (Fig. 3) (44). However, 
layer structure is only one of the critical film 
properties to be taken into account when design- 
ing films for specific applications. For example, 
the higher surface coverage and layer density 
associated with electromagnetic assembly can 
allow for electrodeposited enzyme films to have 
higher enzymatic activity than comparable 
immersive-assembled films (68). Of course, the 
layer structure and density are not relevant if 
the desired film components cannot be layered, 
which can be an issue, for example, when using 
materials with low charge density (e.g., reduced- 
graphene oxide) or with a low surface area of 
contact (e.g., branched nanowires). In such 
cases, technologies such as dewetting and 
vacuum assembly enable film formation using 
constituents that cannot be easily layered using 
other technologies (8, 89). These examples dem- 
onstrate how the judicious choice of assembly 
technology can enable the assembly of new and 
improved thin films. As our understanding of the 
different technologies and how they compare to 
each other increases, so does the opportunity to 
let this insight help guide the development of the 
next generation of LbL assembled thin films. 

It is noteworthy that the assembly technologies 
discussed herein were not originally developed 
for LbL assembly, and crossover technologies 
from other fields will continue to play an im- 
portant part for new, and perhaps even revolu- 
tionary, developments. One interesting example 
involving industrial-scale layering was performed 
using a modified car wash for spray assembly on 
a full-sized car (101). Technologies long used in 
the pharmaceutical industry, such as methods 
used to treat, purify, and concentrate pharma- 
ceuticals, may prove transformative for biomed- 
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ical applications. Similarly, using everyday objects 
like spray-paint cans could revolutionize assem- 
bly methods by essentially combining dewetting 
and spray assembly for rapid region-specific as- 
sembly with little to no material waste and no 
washing steps. Other combinations between ex- 
isting assembly technologies should also help to 
expedite and automate the assembly process. 
Along these lines, technologies for assembly on 
particulate substrates are expected to continue 
to integrate immobilization methods, as they al- 
low collections of particles to be treated like pla- 
nar substrates, making accessible many of the 
planar assembly technologies discussed herein. 
Another promising approach for particulate sub- 
strates could be to use a type of “sponge” to ad- 
sorb excess polymer from solution, thus removing 
the need to pellet the particles. In terms of future 
developments for applications, it will be im- 
portant to understand the interaction between 
multilayer films and complex and natural en- 
vironments, such as those found in the human 
body (40), outdoors, or in seawater. An impor- 
tant aspect of this could be the use of functional 
substrates capable of compounding the benefits 
of different multilayers in a synergistic fashion. 
Overall, the future of LbL assembly is bright, 
and as the black box of assembly technologies is 
slowly illuminated, great potential for innova- 
tion and application will be found. 
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INTRODUCTION: The copy number and in- 
tracellular localization of RNA are important 
regulators of gene expression. Measurement 
of these properties at the transcriptome scale 
in single cells will give answers to many ques- 
tions related to gene expression and regulation. 
Single-molecule RNA imaging approaches, such 
as single-molecule fluorescence in situ hybrid- 
ization (smFISH), are powerful tools for count- 
ing and mapping RNA; however, the number 
of RNA species that can be simultaneously im- 
aged in individual cells has been limited. This 
makes it challenging to perform transcriptomic 
analysis of single cells in a spatially resolved 
manner. Here, we report multiplexed error- 
robust FISH (MERFISH), a single-molecule im- 
aging method that allows thousands of RNA 


Image 1 


Image 2 


ImageN 


species to be imaged in single cells by using 
combinatorial FISH labeling with encoding 
schemes capable of detecting and/or correct- 
ing errors. 


RATIONALE: We labeled each cellular RNA 
with a set of encoding probes, which contain 
targeting sequences that bind the RNA and 
readout sequences that bind fluorescently la- 
beled readout probes. Each RNA species is 
encoded with a particular combination of read- 
out sequences. We used successive rounds of 
hybridization and imaging, each with a differ- 
ent readout probe, to identify the readout se- 
quences bound to each RNA and to decode the 
RNA. In principle, combinatorial labeling al- 
lows the number of detectable RNA species to 
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MERFISH for transcriptome imaging. Numerous RNA species can be identified, counted, and 
localized in a single cell by using MERFISH, a single-molecule imaging approach that uses combi- 
natorial labeling and sequential imaging with encoding schemes capable of detection and/or 
correction of errors. This highly multiplexed measurement of individual RNAs can be used to 
compute the gene expression profile and noise, covariation in expression among different genes, 
and spatial distribution of RNAs within single cells. 
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grow exponentially with the number of imaging 
rounds, but the detection errors also increase 
exponentially. To combat such accumulating 
errors, we exploited error-robust encoding 
schemes used in digital electronics, such as 
the extended Hamming code, in the design of 


our encoding probes but 
modified these schemes 
Read the full article in order to account for the 
at http://dx.doi. error properties in FISH 
org/10.1126/ measurements. We assigned 
science.aaa6090 each RNA a binary word 
betwee Sante ass Syour modified Hameing 


code and encoded the RNA with a combina- 
tion of readout sequences according to this 
binary word. 


RESULTS: We first imaged 140 RNA species 
in human fibroblast cells using MERFISH 
with 16 rounds of hybridization and a mod- 
ified Hamming code capable of both error 
detection and correction. We obtained ~80% 
detection efficiency and observed excellent 
correlation of RNA copy numbers determined 
with MERFISH with both bulk RNA sequenc- 
ing data and conventional smFISH measure- 
ments of individual genes. 

Next, we used an alternative MERFISH en- 
coding scheme, which is capable of detecting 
but not correcting errors, to image 1001 RNA 
species in individual cells using only 14 rounds 
of hybridization. The observed RNA copy num- 
bers again correlate well with bulk sequencing 
data. However, the detection efficiency is only 
one-third that of the error-correcting encod- 
ing scheme. 

We performed correlation analysis of the 10* 
to 10° pairs of measured genes and identified 
many covarying gene groups that share com- 
mon regulatory elements. Such grouping allowed 
us to hypothesize potential functions of ~100 
unannotated or partially annotated genes of 
unknown functions. We further analyzed cor- 
relations in the spatial distributions of different 
RNA species and identified groups of RNAs 
with different distribution patterns in the cell. 


DISCUSSION: This highly multiplexed imag- 
ing approach enables analyses based on the 
variation and correlation of copy numbers and 
spatial distributions of a large number of RNA 
species within single cells. Such analyses should 
facilitate the delineation of regulatory networks 
and in situ identification of cell types. We en- 
vision that this approach will allow spatially 
resolved transcriptomes to be determined for 
single cells. m 
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Knowledge of the expression profile and spatial landscape of the transcriptome in 
individual cells is essential for understanding the rich repertoire of cellular behaviors. 
Here, we report multiplexed error-robust fluorescence in situ hybridization (MERFISH), a 
single-molecule imaging approach that allows the copy numbers and spatial localizations 
of thousands of RNA species to be determined in single cells. Using error-robust encoding 
schemes to combat single-molecule labeling and detection errors, we demonstrated the 
imaging of 100 to 1000 distinct RNA species in hundreds of individual cells. Correlation 
analysis of the ~10* to 10° pairs of genes allowed us to constrain gene regulatory 
networks, predict novel functions for many unannotated genes, and identify distinct spatial 
distribution patterns of RNAs that correlate with properties of the encoded proteins. 


ystem-wide analyses of the abundance and 

spatial organization of RNAs in single cells 

promise to transform our understanding 

in many areas of cell and developmental 

biology, such as the mechanisms of gene 
regulation, the heterogeneous behavior of cells, 
and the development and maintenance of cell 
fate (1). Single-molecule fluorescence in situ hy- 
bridization (SmFISH) has emerged as a powerful 
tool for studying the copy number and spatial 
organization of RNAs in single cells either in 
isolation or in their native tissue context (2, 3). 
Taking advantage of its ability to map the spatial 
distributions of specific RNAs with high resolu- 
tion, smFISH has revealed the importance of sub- 
cellular RNA localization in diverse processes such 
as cell migration, development, and polarization 
(4-8). In parallel, the ability of smFISH to pre- 
cisely measure the copy numbers of specific RNAs 
without amplification bias has allowed quantita- 
tive measurement of the natural fluctuations in 
gene expression, which has in turn elucidated the 
regulatory mechanisms that shape such fluctua- 
tions and their role in a variety of biological pro- 
cesses (9-13). 

Recent advances in imaging and analysis meth- 
ods have allowed hundreds of smFISH measure- 
ments to be performed in an automated manner, 
substantially expanding our knowledge of the 
RNA expression profile and spatial organization 
in different organisms (14, 15). However, applica- 
tion of the smFISH approach to many systems- 
level questions remains limited by the number of 
RNA species that can be simultaneously mea- 
sured in single cells. State-of-the-art efforts by 
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using combinatorial labeling with either color- 
based barcodes or sequential hybridization have 
enabled simultaneous measurements of 10 to 30 
different RNA species in individual cells (6-19), 
yet many interesting biological questions would 
benefit from the measurement of hundreds to 
thousands of RNAs within a single cell. For ex- 
ample, analysis of how the expression profile of 
such a large number of RNAs vary from cell to 
cell and how these variations correlate among 
different genes could be used to systematically 
identify coregulated genes and map regulatory 
networks, knowledge of the subcellular organi- 
zations of numerous RNAs and their correlations 
could help elucidate molecular mechanisms under- 
lying the establishment and maintenance of many 
local cellular structures, and RNA profiling of in- 
dividual cells in native tissues could allow in situ 
identification of cell type. 

Here, we report multiplexed error-robust FISH 
(MERFISH), a highly multiplexed smFISH imag- 
ing method that substantially increases the num- 
ber of RNA species that can be simultaneously 
imaged in single cells by using combinatorial la- 
beling and sequential imaging with error-robust 
encoding schemes. We demonstrated this tran- 
scriptome imaging approach by simultaneously 
measuring 140 RNA species with an encoding 
scheme that can both detect and correct errors 
and 1001 RNA species with an encoding scheme 
that can detect but not correct errors. Correlation 
analyses of the copy number variations and spa- 
tial distributions of these genes allowed us to 
identify groups of genes that are coregulated and 
groups of genes that share similar spatial distribu- 
tion patterns inside the cell. 


Combinatorial labeling with 
error-robust encoding schemes 


Combinatorial labeling that identifies each RNA 
species by multiple (V) distinct signals offers a 


route to rapidly increase the number of RNA spe- 
cies that can be probed simultaneously in indi- 
vidual cells (Fig. 1A). However, this approach to 
scaling up the throughput of smFISH to the sys- 
tems scale faces a substantial challenge because 
not only does the number of addressable RNA 
species increases exponentially with N, but the 
detection error rates also grow exponentially with 
N (Fig. 1, B to D). Imagine a conceptually simple 
scheme to implement combinatorial labeling, in 
which each RNA species is encoded with a N-bit 
binary word, and the sample is probed with NV 
corresponding rounds of hybridization, each round 
targeting only the subset of RNAs that should 
read “1” in the corresponding bit (fig. $1). NV 
rounds of hybridization would allow 2% - 1 RNA 
species to be probed. With just 16 hybridizations, 
more than 64,000 RNA species—which should 
cover the entire human transcriptome, including 
both messenger RNAs (mRNAs) and noncoding 
RNAs (20)—could be identified (Fig. 1B, black sym- 
bols). However, as NV increases, the fraction of RNAs 
properly detected (the calling rate) would rapidly 
decrease and, more troublingly, the fraction of 
RNAs that are identified as incorrect species (the 
misidentification rate) would rapidly increase (Fig. 
1, C and D, black symbols). With realistic error 
rates per hybridization (measured below), the ma- 
jority of RNA molecules would be misidentified 
after 16 rounds of hybridizations. 

To address this challenge, we designed error- 
robust encoding schemes in which only a sub- 
set of the 2” - 1 words separated by a certain 
Hamming distance (27) were used to encode RNAs. 
In a codebook in which the minimum Hamming 
distance is 4 (HD4 code), at least four bits must 
be read incorrectly to change one valid code word 
into another (fig. S2A). As a result, every single-bit 
error produces a word that is exclusively close to 
a single RNA-encoding word, allowing such er- 
rors to be detected and corrected (fig. S2B). 
Double-bit errors produce words with an equal 
Hamming distance of 2 from multiple valid code 
words and, thus, can be detected but not corrected 
(fig. S2C). Such a code should substantially increase 
the calling rate and reduce the misidentification 
rate (Fig. 1, C and D, blue symbols). To further 
account for the fact that it is more likely to miss a 
hybridization event (an 1-0 error) than to mis- 
identify a background spot as an RNA (an 0—1 
error) in smFISH measurements, we designed a 
modified HD4 (MHD4) code, in which the num- 
ber of 1 bits were kept both constant and rela- 
tively low—only four per word in this work—so 
as to reduce error and avoid biased detection. This 
MHD4 code should further increase the calling 
rate and reduce the misidentification rate (Fig. 1, 
C and D, purple symbols). 

In addition to the error considerations, several 
practical challenges have also made it difficult to 
probe a large number of RNA species, such as the 
high cost of the massive number of distinct FISH 
probes needed and the long time required to com- 
plete many rounds of hybridization. An oligopaint 
approach has been previously developed to generate 
a large number of oligonucleotide probes to label 
chromosome DNA and to introduce nontargeting 
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Fig. 1. MERFISH: A highly multiplexed smFISH approach enabled by combinatorial labeling and 
error-robust encoding. (A) Schematic depiction of the identification of multiple RNA species in N rounds 
of imaging. Each RNA species is encoded with a N-bit binary word, and during each round of imaging, only 
the subset of RNAs that should read 1 in the corresponding bit emit signal. (B to D) The number of 
addressable RNA species (B); the rate at which these RNAs are properly identified—the “calling rate” (C); 
and the rate at which RNAs are incorrectly identified as a different RNA species—the “misidentification 
rate” (D); plotted as a function of the number of bits (N) in the binary words encoding RNA. Black indicates 
a simple binary code that includes all 2-1 possible binary words. Blue indicates the HD4 code in which the 
Hamming distance separating words is 4. Purple indicates a modified HD4 (MHD4) code where the 
number of 1 bits are kept at four. The calling and misidentification rates are calculated with per-bit error 
rates of 10% for the 1+0 error and 4% for the O-1 error. (E) Schematic diagram of the implementation of a 
MHD4 code for RNA identification. Each RNA species is first labeled with ~192 encoding probes that 
convert the RNA into a specific combination of readout sequences (Encoding hyb). These encoding probes 
each contain a central RNA-targeting region flanked by two readout sequences, drawn from a pool of N 
different sequences, each associated with a specific hybridization round. Encoding probes for a specific 
RNA species contain a particular combination of four of the N readout sequences, which correspond to the 
four hybridization rounds in which this RNA should read 1. N subsequent rounds of hybridization with the 
fluorescent readout probes are used to probe the readout sequences (hyb 1, hyb 2, ..., hyb N). The bound 
probes are inactivated by photobleaching between successive rounds of hybridization. For clarity, only one 
possible pairing of the readout sequences is depicted for the encoding probes; however, all possible pairs 
of the four readout sequences are used at the same frequency and distributed randomly along each 
cellular RNA in the actual experiments. 


sites for secondary activities (22). Inspired by 
this approach, we designed a two-step labeling 
scheme to encode and read out cellular RNAs 
(Fig. 1E). First, we label cellular RNAs with a set 
of encoding probes, each probe comprising a 
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RNA targeting sequence and two flanking readout 
sequences. Four of the N distinct readout se- 
quences were assigned to each RNA species based 
on the N-bit MHD4 code word of the RNA. 
Second, we identified these N readout sequences 


with complementary FISH probes (the readout 
probes) via N rounds of hybridization and imag- 
ing, each round using a different readout probe. 
To increase the signal-to-background ratio, we 
labeled every cellular RNA with ~192 encoding 
probes. Because each encoding probe contained 
two of the four readout sequences associated with 
that RNA (Fig. 1E), a maximum of ~96 readout 
probes can bind to each cellular RNA per hybrid- 
ization round. To generate the massive number of 
encoding probes required, we amplified them 
from array-derived oligonucleotide pools contain- 
ing tens of thousands of custom sequences using 
a modified form of the oligopaint protocol com- 
prising in vitro transcription followed by reverse 
transcription (fig. S3 and supplementary mate- 
rials, materials and methods, “Probe Synthesis”) 
(22, 23). This two-step labeling approach dra- 
matically diminished the total hybridization time 
for an experiment; we found that efficient hybrid- 
ization to the readout sequences took only 15 min, 
whereas efficient direct hybridization to cellular 
RNA required more than 10 hours. 


Measuring 140 genes with MERFISH 
by use of a 16-bit MHD4 code 


To test the feasibility of this error-robust, multi- 
plexed imaging approach, we performed a 140-gene 
measurement on human fibroblast cells (IMR90) 
using a 16-bit MHD4 code to encode 130 RNA 
species while leaving 10 code words as misiden- 
tification controls (table S1). After each round of 
hybridization with the fluorescent readout probes, 
cells were imaged by means of conventional wide- 
field imaging with an oblique-incidence illumina- 
tion geometry. Fluorescent spots corresponding 
to individual RNAs were clearly detected and were 
then efficiently extinguished via a brief photo- 
bleaching step (Fig. 2A). The sample was stable 
throughout the 16 rounds of iterative labeling 
and imaging: The change in the number of fluo- 
rescent spots from round to round matched the 
expected change predicted on the basis of the rel- 
ative abundances of RNA species targeted in each 
round derived from bulk sequencing, and we did 
not observe a systematic decreasing trend with 
increasing number of hybridization rounds (fig. 
S4A). The average brightness of the spots varied 
from round to round with a standard deviation of 
40%, which is likely due to different binding ef- 
ficiencies of the readout probes to the different 
readout sequences on the encoding probes (fig. 
S4B). We observed only a small, systematic decreas- 
ing trend in the spot brightness with increasing 
hybridization rounds, which was on average 4% 
per round (fig. S4B). 

We then constructed binary words from the 
observed fluorescent spots based on their on-off 
patterns across the 16 hybridization rounds (Fig. 
2, B to D). If the word exactly matched one of the 
140 MHD4 code words (exact matches) or differed 
by only one bit (error-correctable matches), we 
assigned it to the corresponding RNA species 
(Fig. 2D). Within the single cell depicted in Fig. 2, 
A and B, more than 1500 RNA molecules corre- 
sponding to 87% of the 130 encoded RNA spe- 
cies were detected after error correction (Fig. 2E). 
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Fig. 2. Simultaneous measurement of 140 RNA species in single cells by use of MERFISH with a 
16-bit MHD4 code. (A) Images of RNA molecules in an IMR90 cell after each hybridization round (hyb 
1 to hyb 16). The images after photobleaching (for example, bleach 1) demonstrate efficient removal of 
fluorescent signals between hybridizations. (B) The localizations of all detected single molecules in this cell 
colored according to their measured binary words. (Inset) The composite, false-colored fluorescent image 
of the 16 hybridization rounds for the boxed subregion with numbered circles indicating potential RNA 
molecules. A red circle indicates an unidentifiable molecule, the binary word of which does not match any 
of the 16-bit MHD4 code words even after error correction. (©) Fluorescent images from each round of 
hybridization for the boxed subregion in (B), with circles indicating potential RNA molecules. (D) 
Corresponding words for the spots identified in (C). Red crosses represent the corrected bits. (E) The RNA 
copy number for each gene observed without (green) or with (blue) error correction in this cell. (F) The 
confidence ratio measured for the 130 RNA species (blue) and the 10 misidentification control words (red) 
normalized to the maximum value observed from the misidentification controls (dashed line). (G@) Scatter 
plot of the average copy number of each RNA species per cell measured with two shuffled codebooks of 
the MHD4 code. The Pearson correlation coefficient is 0.94 with a P value of 1 x 10°°°. The dashed line 
corresponds to the y = x line. (H) Scatter plot of the average copy number of each RNA species per cell 
versus the abundance determined by bulk sequencing in FPKM. The Pearson correlation coefficient 
between the logarithmic abundances of the two measurements was 0.89 with a P value of 3 x 107°°. 
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Similar observations were made in ~400 cells 
from seven independent experiments. On aver- 
age, ~4 times as many RNA molecules and ~2 
times as many RNA species were detected per 
cell after error correction as compared with the 
values obtained before error correction (fig. $5). 

Two types of errors can occur in the copy 
number measurement of each RNA species: (i) 
Some molecules of this RNA species are not de- 
tected, leading to a drop in calling rate, and (ii) 
some molecules from other RNA species are mis- 
identified as this RNA species. To assess the extent 
of misidentification, we used the 10 misidentifi- 
cation control words—code words that were not 
associated with any cellular RNA. Although matches 
to these control words were observed, they occurred 
far less frequently than did the real RNA-encoding 
words: 95% of the 130 RNA-encoding words were 
counted more frequently than the median count 
for these control words. Moreover, we typically 
found the ratio of the number of exact matches 
to the number of matches with one-bit errors for 
a real RNA-encoding word to be substantially 
higher than the same ratios observed for the mis- 
identification controls, as expected (fig. S6, A and 
B). Using this ratio as a measure of the confidence 
in RNA identification, we found that 91% of the 
130 RNA species had a confidence ratio greater 
than the maximum confidence ratio observed for 
the misidentification controls (Fig. 2F), demon- 
strating a high accuracy of RNA identification. 
Subsequent analyses were conducted only on 
these 91% of genes. 

To estimate the calling rate, we used the error- 
correction ability of the MHD4 code to determine 
the 1—0 error rates (10% on average) and 0—1 
error rates (4% on average) for each hybridiza- 
tion round (fig. S6, C and D). Using these error 
rates, we estimated an ~80% calling rate for in- 
dividual RNA species after error correction— 
~80% of the fluorescent spots corresponding to 
a RNA species were decoded correctly (fig. S6E). 
Although the remaining 20% of spots contrib- 
uted to a loss in detection efficiency, most of them 
did not cause species misidentification because 
they were decoded as double-bit error words and 
discarded. 

To test for potential technical bias in our mea- 
surements, we probed the same 130 RNAs spe- 
cies with a different MHD4 codebook by shuffling 
the code words among different RNA species 
(table S1) and changing the encoding probe se- 
quences. Measurements with this alternative code 
gave similar misidentification and calling rates 
(fig. S7). The copy numbers of individual RNA 
species per cell measured with these two code- 
books showed excellent agreement with a Pearson 
correlation coefficient of 0.94 (Fig. 2G), indicating 
that the choice of encoding scheme did not bias 
the measured counts. 

In order to validate the copy numbers derived 
from our MERFISH experiments, we performed 
conventional smFISH measurements on 15 of the 
130 genes, spanning the full measured abun- 
dance range of three orders of magnitude. For 
each of these genes, both the average copy num- 
ber and the copy number distribution across 
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many cells agreed quantitatively between our 
MERFISH and conventional smFISH measure- 
ments (fig. S8, A and B). The ratio of the copy 
numbers determined by these two approaches 
was 0.82 + 0.06 (mean + SEM across the 15 mea- 
sured RNA species) (fig. S8B), which agrees with 
the estimated 80% calling rate for our multiplexed 
imaging approach. The quantitative match be- 
tween this ratio and our estimated calling rate 
over the full measured abundance range addi- 
tionally supports our assessment that the misiden- 
tification error was low. Given that the agreement 
between the MERFISH and conventional smFISH 
results extended to the genes at the lowest mea- 
sured abundance (<1 copy per cell) (fig. S8B), we 
estimate that our measurement sensitivity was 
better than 1 copy per cell. 

As a final validation, we compared the abun- 
dance of each RNA species averaged over hun- 
dreds of cells to those obtained from a bulk RNA 
sequencing measurement that we performed on 
the same cell line. Our imaging results correlated 
remarkably well with bulk sequencing results, 
with a Pearson correlation coefficient of 0.89 
(Fig. 2H). 


High-throughput analysis of 
cell-to-cell variation in gene expression 


The MERFISH approach allows parallelization 
of measurements of many individual RNA spe- 
cies and covariation analysis between different 
RNA species. We first illustrated the paralleliza- 
tion aspect by examining the cell-to-cell variation 
in the expression level of each of the measured 
genes (Fig. 3A). To quantify the measured varia- 
tion, we computed the Fano factors, defined as 
the ratio of the variance to the mean RNA copy 
number, for all measured RNA species. The Fano 
factors substantially deviated from 1, the value 
expected for a simple Poisson process, for many 
genes and exhibited an increasing trend with the 
mean RNA abundance (Fig. 3B), which is consist- 
ent with a previous observation for other cell types 
(24). A simple model for promoter regulation—the 
promoter stochastically switches between on and 
off states with global constraints on the kinetic 
rates—has been previously suggested to ratio- 
nalize such a trend (24, 25). According to this mod- 
el, this trend of increasing Fano factors with mean 
RNA abundance can be explained by changes 
in the transcription rate and/or promoter off- 
switching rates but not by changes in the pro- 
moter on-switching rate. 

Moreover, we identified several RNA species 
with substantially larger Fano factors than this 
average trend. For example, we found that SLC5A3, 
CENPF, MKI67, TNC, and KIAAII99 displayed 
Fano factor values substantially higher than those 
of the other genes expressed at similar abundance 
levels. The high variability of some of these genes 
can be explained by their association with the 
cell cycle. For example, two of these particularly 
“noisy” genes, MKI67 and CENPF, are both an- 
notated as cell-cycle related genes (26), and based 
on their bimodal expression (Fig. 3C), we propose 
that their transcription is strongly regulated by 
the cell cycle. Other high-variability genes did not 
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show the same bimodal expression patterns and 
are not known to be associated with the cell cycle. 
Understanding the origin and implications of 
noisy gene expression is an active topic of current 
research (24). 


Analysis of expression covariation 
among different genes 


Analysis of covariations in the expression levels 
of different genes can reveal which genes are 
coregulated and elucidate gene regulatory path- 
ways. At the population level, such analysis often 
requires the application of external stimuli to drive 
gene expression variation; hence, correlated ex- 
pression changes can be observed among genes 
that share common regulatory elements influ- 
enced by the stimuli (27). At the single-cell level, 
one can take advantage of the natural stochastic 
fluctuations in gene expression for such analysis 
and can thus study multiple regulatory networks 
without having to stimulate each of them indi- 
vidually. Such covariation analysis can constrain 
regulatory networks, suggest new regulatory path- 
ways, and predict function for unannotated genes 
based on associations with covarying genes (JI, 28). 

We applied this approach to the 140-gene mea- 
surements and examined the ~10,000 pairwise 
correlation coefficients that describe how the 
expression levels of each pair of genes covaried 
from cell to cell. Many of the highly variable genes 
showed tightly correlated or anticorrelated varia- 
tions (Fig. 3C). To better understand the correla- 
tions for all gene pairs, we adopted a hierarchical 
clustering approach, commonly used in the analy- 
sis of both bulk and single-cell expression data 
(29, 30), to organize these genes on the basis of 
their correlation coefficients (Fig. 3D). From the 
cluster tree structure, we identified seven groups 
of genes with substantially correlated expression 
patterns (Fig. 3D and table S2). Within each of 
the seven groups, every gene showed significant- 
ly stronger average correlation with other mem- 
bers of the group than with genes outside the 
group (table S2). To further validate and under- 
stand these groups, we identified gene ontology 
(GO) terms (37) enriched in each of these seven 
groups. The enriched GO terms within each group 
shared similar functions and were largely specific 
to each group (Fig. 3E and table S2), validating 
the notion that the observed covariation in ex- 
pression reflects some commonalities in the regu- 
lation of these genes. 

Here, we describe two of these groups as il- 
lustrative examples. The predominant GO terms 
associated with group 1 were terms associated 
with the extracellular matrix (ECM) (Fig. 3, D and 
E, and table S2). Notable members of this group 
included ECM components—such as FBN1, FBN2, 
COLSA, COL7A, and TNC—and glycoproteins link- 
ing the ECM and cell membranes, such as VCAN 
and THBS1. The group also included an unanno- 
tated gene, KIAA1199, which we would predict to 
play a role in ECM metabolism on the basis of its 
association with this cluster. Indeed, this gene 
has recently been identified as an enzyme in- 
volved in the regulation of hyaluronan, which is 
a major sugar component of the ECM (32). 


Group 6 contained many genes that encode 
vesicle transport proteins and proteins associ- 
ated with cell motility (Fig. 3, D and E, and table 
S2). The vesicle transport genes included micro- 
tubule motors and related genes DYNCIH, CKAPI1, 
and factors associated with vesicle formation and 
trafficking, such as DNAJC13 and RAB3B. Again, 
we found an unannotated gene, KIAA1462, with- 
in this cluster. On the basis of its strong cor- 
relation with DYNCIH1 and DNAJC73, we predict 
that this gene may be involved in vesicle trans- 
port. The cell motility genes in this group included 
genes encoding actin-binding proteins such as 
AFAPI, SPTANI, SPTBN1, and MYH1O, and genes 
involved in the formation of adhesion complexes, 
such as FLNA and FLNC. Several guanosine 
triphosphatase (GTPase)-associated factors in- 
volved in the regulation of cell motility, attach- 
ment, and contraction also fell into this group, 
including DOCK7, ROCK2, IQGAP1, PRKCA, and 
AMOTLI. The observation that some cell motility 
genes correlated with vesicle transport genes is 
consistent with the role of vesicle transport in 
cell migration (33). An additional feature of group 
6 is that a subset of these genes—in particular, 
those related to cell motility—were anticorrelated 
with members of the ECM group discussed above 
(Fig. 3D). This anticorrelation may reflect regu- 
latory interactions that mediate the switching of 
cells between adherent and migratory states. 


Mapping spatial distributions of RNAs 


As an imaging-based approach, MERFISH also 
allowed us to investigate the spatial distributions 
of many RNA species simultaneously. Several pat- 
terns emerged from the visual inspection of indi- 
vidual genes, with some RNA transcripts enriched 
in the perinuclear region, some enriched in the 
cell periphery, and some scattered throughout 
the cell (Fig. 4A). To identify genes with similar 
spatial distributions, we determined the correla- 
tion coefficients for the spatial density profiles 
of all pairs of RNA species and organized these 
RNAs according to the pairwise correlations 
again using the hierarchical clustering approach. 
The correlation coefficient matrix showed groups 
of genes with correlated spatial organizations, and 
the two most notable groups with the strongest 
correlations are indicated in Fig. 4B. Group I 
RNAs appeared enriched in the perinuclear re- 
gion, whereas group II RNAs appeared enriched 
near the cell periphery (Fig. 4C). Quantitative 
analysis of the distances between each RNA mol- 
ecule and the cell nucleus or the cell periphery 
indeed confirmed this visual impression (Fig. 4D). 

Group I contained genes encoding extracel- 
lular proteins such as FBN1, FBN2, and THSB1; 
secreted proteins such as PAPPA; and integral 
membrane proteins such as LRP1 and GPR107. 
These proteins have no obvious commonalities 
in function. Rather, a GO analysis showed signif- 
icant enrichment for location terms, such as extra- 
cellular region, basement membrane, or perivitelline 
space (Fig. 4E). To reach these locations, proteins 
must pass through the secretion pathway, which 
often requires translation of mRNA at the endo- 
plasmic reticulum (ER) (34, 35). Thus, we propose 
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Fig. 3. Cell-to-cell variations and pairwise correlations for the RNA spe- 
cies determined from the 140-gene measurements. (A) Comparison of 
gene expression levels in two individual cells. (B) Fano factors for individual 
genes. Error bars represent standard error of the mean determined from seven 
independent data sets. (C) Z-scores of the expression variations of four ex- 
ample pairs of genes showing correlated (top two) or anticorrelated (bottom 
two) variation for 100 randomly selected cells. Z-score is defined as the 
difference from the mean normalized by the standard deviation. (D) Matrix of 
the pairwise correlation coefficients of the cell-to-cell variation in expression 
for the measured genes, shown together with the hierarchical clustering tree. 
The seven groups identified by a specific threshold on the cluster tree (dashed 
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line) are indicated by the black boxes in the matrix and colored lines on the 
tree, with gray lines on the tree indicating ungrouped genes. Different threshold 
choices on the cluster tree could be made to select either smaller subgroups 
with tighter correlations or larger super-groups containing more weakly cou- 
pled subgroups. Two of the seven groups are enlarged on the right. (E) Enrich- 
ment of 30 selected, statistically significantly enriched GO terms in the seven 
groups. Enrichment refers to the ratio of the fraction of genes within a group 
that have the specific GO term to the fraction of all measured genes having 
that term. Top 10 statistically significantly enriched GO terms for each of the 
seven groups are shown in table S2. Not all of the GO terms presented here are 
in the top 10 list. 
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Fig. 4. Distinct spatial distributions of RNAs observed in the 140-gene 
measurements. (A) Examples of the spatial distributions observed for four 
different RNA species in a cell. (B) Matrix of the pairwise correlation co- 
efficients describing the degree with which the spatial distributions of each 
gene pair is correlated, shown together with the hierarchical clustering tree. 
Two strongly correlating groups are indicated by the black boxes on the matrix 
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and color on the tree. (C) The spatial distributions of all RNAs in the two groups 
in two example cells. Light blue symbols, group | genes; red symbols, group II 
genes. (D) Average distances for genes in group | and genes in group II to the 
cell edge or the nucleus normalized to the average distances for all genes. Error 
bars represent SEM across seven data sets. (E) Enrichment of GO terms in 
each of the two groups. 


that the spatial pattern that we observed for 
these mRNAs reflects their cotranslational en- 
richment at the ER. The enrichment of these 
mRNAs in the perinuclear region (Fig. 4, C and D, 
light blue), where the rough ER resides, supports 
this conclusion. 

Group II contained genes encoding the actin- 
binding proteins, including filamins FLNA and 
FLNC, talin TLNI1, and spectrins SPTANI and 
SPTBN1; the microtubule-binding protein CKAP5; 
and the motor proteins MYH10 and DYNCIH1. 
This group was enriched with GO terms such as 
cortical actin cytoskeleton, actin filament bind- 
ing, and cell-cell adherens junction (Fig. 4E). It 
has been shown previously that B-actin mRNA is 
enriched near the cell periphery in fibroblasts, as 
are mRNAs that encode members of the actin- 
binding Arp2/3 complex (36, 37). The enrichment 
of group II mRNAs in the peripheral region of 
the cells (Fig. 4, C and D) suggests that the spa- 
tial distribution of the group II genes might 
be related to the distribution of actin cytoskele- 
ton mRNAs. 


Measuring 1001 genes with MERFISH 
by use of a 14-bit MHD2 code 


Last, we sought to further increase the through- 
put of our MERFISH measurement by simulta- 
neously imaging ~1000 RNA species. This increase 
could be achieved with our MHD4: code by in- 
creasing the number of bits per code word to 32 
while maintaining the number of 1 bits per word 
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at four (Fig. 1B). This could be implemented by 
either increasing the number of hybridization 
round to 32 or maintaining 16 rounds of hybrid- 
ization, but using two-color imaging in each round. 
We pursued an alternative approach that did 
not require an increase in the number of hybrid- 
izations or color channels by relaxing the error 
correction requirement but keeping the error- 
detection capability. For example, by reducing the 
Hamming distance from 4 to 2, we could use all 
14-bit words that contain four 1 bits to encode 
1001 genes and probe these RNAs with only 14 
rounds of hybridization. However, because a sin- 
gle error can produce a word equally close to two 
different code words, error correction is no longer 
possible for this modified Hamming-distance-2 
(MHD2) code. Hence, we expect the calling rate 
to be lower and the misidentification rate to be 
higher with this encoding scheme. 

To evaluate the performance of this 14-bit 
MHD2 code, we set aside 16 of the 1001 possible 
code words as misidentification controls and 
used the remaining 985 words to encode cellular 
RNAs (table S3). Among these 985 RNAs, we in- 
cluded 107 RNA species probed in the 140-gene 
experiments as an additional control. We per- 
formed the 1001-gene experiments in IMR90 cells 
by using a similar procedure as described above. 
To allow all encoding probes to be synthesized 
from a single 100,000-member oligopool, we re- 
duced the number of encoding probes per RNA 
species to ~94. Fluorescent spots corresponding 


to individual RNA molecules were again clearly 
detected in each round of hybridization with the 
readout probes, and based on their on-off patterns, 
these spots were decoded into RNA (Fig. 5A and 
fig. S9, A and B). In the cell shown in Fig. 5A, 430 
RNA species were detected, and similar results 
were obtained in ~200 imaged cells in three in- 
dependent experiments. 

As expected, the misidentification rate of this 
scheme was higher than that of the MHD4 code. 
Of all real RNA words, 77% were detected more 
frequently than the median count for the mis- 
identification controls, instead of the 95% value 
observed in the MHD4 measurements. Using the 
same confidence ratio analysis as described above, 
we found that 73% (instead of 91% for the MHD4 
measurements) of the 985 RNA species were 
measured with a confidence ratio larger than 
the maximum value observed for the misidenti- 
fication controls (fig. S9C). RNA copy numbers 
measured from these 73% RNA species showed 
excellent correlation with our bulk RNA sequenc- 
ing results (Pearson correlation coefficient r = 
0.76) (Fig. 5B, black). The remaining 27% of the 
genes still exhibit good, albeit lower, correlation 
with the bulk RNA sequencing data (7 = 0.65) 
(Fig. 5B, red), but we took the conservative mea- 
sure of excluding them from further analysis. 

The lack of an error correction capability also 
decreased the calling rate of each RNA species: 
When comparing the 107 RNA species common 
in both the 1001-gene and 140-gene measurements, 
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Fig. 5. Simultaneous measurements of 1001 RNA species in single cells by using MERFISH with a 
14-bit MHD2 code. (A) The localizations of all detected single molecules in a cell colored based on their 
measured binary words. (Inset) The composite, false-colored fluorescent image of the 14 hybridization 
rounds for the boxed subregion with numbered circles indicating potential RNA molecules. Red circles 
indicate unidentifiable molecules, the binary words of which do not match any of the 14-bit MHD2 code 
words. Images of individual hybridization round are shown in fig. S9A. (B) Scatter plot of the average copy 
number per cell measured in the 1001-gene experiments versus the abundance measured via bulk 
sequencing. The black symbols are for the 73% of genes detected with confidence ratios higher than the 
maximum ratio observed for the misidentification controls. The Pearson correlation coefficient is 0.76 with 
a P value of 3 x 10°°. The red symbols are for the remaining 27% of genes. The Pearson correlation 
coefficient is 0.65 with a P value of 3 x 10°*°. (C) Scatter plot of the average copy number for the 107 genes 


shared in both the 1001-gene measurement with the 
MHD4 code. The Pearson correlation coefficient is 
corresponds to the y = x line. 


we found that the copy numbers per cell of these 
RNA species were lower in the 1001-gene mea- 
surements (Fig. 5C and fig. S9D). The total count 
of these RNAs per cell was ~1/3 of that observed 
in the 140-gene measurements. Thus, the lack 
of error correction in the MHD2 code reduced 
the calling rate to ~30% of that of the MHD4 
code, which is consistent with the decrease in 
calling rate observed for the MHD4 code when 
error correction was not applied. As expected 
from the quantitative agreement between 140-gene 
measurements and conventional smFISH results, 
comparison of the 1001-gene measurements with 
conventional smFISH results for 10 RNA species 
also indicated a calling rate that is ~1/3 of that 
observed for the MHD4 code (fig. S8C). Despite 
the expected reduction in calling rate, the good 
correlations found between the copy numbers 
observed in the 1001-gene measurements and 
those observed in the 140-gene measurements, as 
well as in conventional smFISH and bulk RNA 
sequencing measurements, indicates that the rel- 
ative abundance of these RNAs can be quantified 
with the MHD2 encoding scheme. 
Simultaneously imaging ~1000 genes in indi- 
vidual cells substantially expanded our ability to 
detect coregulated genes. The matrix of pairwise 
correlation coefficients determined from the cell- 
to-cell variations in the expression levels of these 
genes is shown in Fig. 6A. Using the same hier- 
archical clustering analysis as described above, 
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HD2 code and the 140-gene measurement with the 
0.89 with a P value of 9 x 10°°°. The dashed line 


we identified ~100 groups of genes with corre- 
lated expression (table S4). Nearly all of these 
~100 groups showed statistically significant en- 
richment of functionally related GO terms (Fig. 
6B and table S4). These included some of the 
groups identified in the 140-gene measurements, 
such as the group associated with cell-replication 
genes and the group associated with cell-motility 
genes (Fig. 6, A and B, groups 7 and 102), as well 
as many new groups. The groups identified here 
included 46 RNA species lacking any previous 
GO annotations, for which we can now hypoth- 
esize function on the basis of their group asso- 
ciation (table S4). For example, KIAA1462 is part 
of the cell motility group, as also shown in the 
140-gene experiments, suggesting a potential role 
of this gene in cell motility (Fig. 6A, group 102). 
Likewise, KIAA0355 is part of a new group en- 
riched in genes associated with heart develop- 
ment (Fig. 6A, group 79), and Ci7orf70 is part of a 
group associated with ribosomal RNA processing 
(Fig. 6A, group 22). Using these groupings, we 
can also hypothesize cellular functions for 61 
transcription factors and other partially annotated 
proteins of unknown functions (table S4). For 
example, the transcription factors Z3CH13 and 
CHD8 are both members of the cell-motility group, 
suggesting their potential role in the transcrip- 
tional regulation of cell-motility genes. Although 
these predicted functions based on gene-association 
analysis require further validation, our covaria- 


tion data provide a resource for generating hy- 
potheses on gene function and regulation. 


Discussion 


We have developed a highly multiplexed detec- 
tion scheme for transcriptomic-scale RNA imag- 
ing in single cells. Using combinatorial labeling, 
sequential hybridization and imaging, and two 
different error-robust encoding schemes, we simulta- 
neously imaged either 140 or 1001 genes in hun- 
dreds of individual human fibroblast cells. Of the 
two encoding schemes presented here, the MHD4 
code is capable of both error detection and error 
correction and hence can provide a higher calling 
rate and a lower misidentification rate than can 
the MHD2 code, which instead can only detect but 
cannot correct errors. MHD2, on the other hand, 
provides a faster scaling of the degree of multi- 
plexing with the number of bits than can MHD4. 
Other error-robust encoding schemes can also be 
used for such multiplexed imaging, and experi- 
menters can set the balance between detection 
accuracy and ease of multiplexing according to 
the specific requirements of the experiments. 

By increasing the number of bits in the code 
words, it should be possible to further increase 
the number of detectable RNA species by using 
MERFISH with either MHD4 or MHD2 codes. 
Because of their much slower increase in error 
rates with the number of bits, we expect the error- 
correcting encoding schemes, such as MHD4, to 
be more favorable for scaling up the measurements. 
For example, using the MHD4 code with 32 total 
bits and four or six 1 bits would increase the num- 
ber of addressable RNA species to 1240 or 27,776, 
respectively; the latter is the approximate scale of 
the human transcriptome. The predicted misiden- 
tification and calling rates are still reasonable for 
the 32-bit MHD4 code (shown in Fig. 1, C and D, 
purple for the MHD4 code with four 1 bits, and 
similar rates were calculated for the MHD4 code 
with six 1 bits). If more accurate measurements are 
desired, an additional increase in the number of 
bits would allow the use of encoding schemes 
with a Hamming distance greater than 4, further 
enhancing the error detection and correction ca- 
pability. Although an increase in the number of 
bits by adding more hybridization rounds would 
increase the data collection time and potentially 
lead to sample degradation, these problems could 
be mitigated by using multiple colors to readout 
multiple bits in each round of hybridization. 

As the degree of multiplexing is increased, it is 
important to consider the potential increase in 
the density of RNAs that need to be resolved in 
each round of imaging. On the basis of our imaging 
and sequencing results, we estimate that including 
the whole transcriptome of the IMR90 cells would 
lead to a total RNA density of ~200 molecules/um*. 
Using our current imaging and analysis methods, 
we could resolve 2 to 3 molecules/um? per hy- 
bridization round (38), which would reach a 
total RNA density of ~20 molecules/um? after 
32 rounds of hybridization. This density should 
allow all but the top 10% most expressed genes to 
be imaged simultaneously or a subset of genes 
with even higher expression levels to be included. 
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Fig. 6. Covariation analysis of the RNA species measured in the 1001-gene measurements. (A) Matrix of all pairwise correlation coefficients of the cell-to- 
cell variation in expression for the measured genes shown with the hierarchical clustering tree. The ~100 identified groups of correlated genes are indicated by 
color on the tree. Zoom-in of four of the groups described in the text are shown on the right. (B) Enrichment of 20 selected, statistically significantly enriched GO 
terms in the four groups. The statistically most significantly enriched GO terms (maximum 10) for each of the ~100 groups are shown in table S4. 


By using more advanced image analysis algo- 
rithms to better resolve overlapping images of 
individual molecules, such as compressed sensing 
(39, 40), it would be possible to extend the re- 
solvable density by approximately fourfold and 
thus allow nearly the entire transcriptome, ex- 
cept for the top 2% most expressed genes, to be 
imaged all together. Last, theoretical predictions 
(17) indicate that the use of superresolution 
imaging (41, 42) could increase the resolvable 
density to ~10° molecules/um?, which should be 
ample to address the entire transcriptome, even 
in cell types with RNA densities substantially 
higher than that of IMR90. However, RNAs in 
densely packed structures, such as p-bodies and 
stress granules, may still elude measurement. 
We have illustrated the utility of the data de- 
rived from highly multiplexed RNA imaging by 
using covariation and correlation analysis to re- 
veal distinct subcellular distribution patterns of 
RNAs, to constrain gene regulatory networks, and 
to predict functions for many previously unanno- 
tated or partially annotated genes with unknown 
functions. We anticipate that many more quan- 
titative analyses could be applied to such data 
sets that include the spatial localization and copy 
number information of many RNA species in 
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individual cells. Given its ability to quantify RNAs 
across a wide range of abundances without 
amplification bias while preserving native con- 
text, we envision that MERFISH will enable 
many applications of in situ transcriptomic anal- 
yses of individual cells in culture or complex 
tissues. 


Materials and Methods 
Probe design 


Each RNA species in our target set was randomly 
assigned a binary code word either from all 140 
possible code words of the 16-bit MHD4 code or 
from all 1001 possible code words of the 14-bit 
MHD2 code, as we describe in the main text. 
The encoding schemes are provided in tables 
S1 and S3. 

We used array-synthesized oligopools as tem- 
plates to make the encoding probes (22, 23). The 
template molecule for each encoding probe con- 
tains three components: (i) a central targeting 
sequence for in situ hybridization to the target 
RNA, (ii) two flanking readout sequences de- 
signed to hybridize each of two distinct readout 
probes, and (iii) two flanking primer sequences 
to allow enzymatic amplification of the probes 


(fig. S3). The readout sequences were taken from 
the 16 possible readout sequences, each corre- 
sponding to one hybridization round. The read- 
out sequences were assigned to the encoding 
probes so that for any RNA species, each of the 
four readout sequences were distributed uni- 
formly along the length of the target RNA and 
appeared at the same frequency. Template mol- 
ecules for the 140-gene library also included a 
common 20-nucleotide (nt) priming region be- 
tween the first polymerase chain reaction (PCR) 
primer and the first readout sequence. This prim- 
ing sequence was used for the reverse transcrip- 
tion step described below. All template sequences 
are provided in table S5. 

We embedded multiple experiments in a sin- 
gle array-synthesized oligopool and used PCR to 
selectively amplify only the oligos required for a 
specific experiment. Primer sequences for this 
indexed PCR reaction were generated from a set 
of orthogonal 25-nt sequences (43). These sequences 
were trimmed to 20 nt and selected for (i) a 
narrow melting temperature range (70 to 80°C), 
(ii) the absence of consecutive repeats of 3 or 
more identical nucleotides, and (iii) the presence 
of a GC clamp—one of the two 3’ terminal bases 
must be G or C. To further improve specificity, 
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Table 1. Primer sequences used in the 140-gene and 1001-gene experiments. 


Experiment name 


140-gene codebook 1 GTTGGTCGGCACTTGGGTGC 
140-gene codebook 2 CGATGCGCCAATTCCGGTTC 
1001-gene CGCGGGCTATATGCGAACCG 


these sequences were then screened against the 
human transcriptome by using Basic Local Align- 
ment Search Tool+ (BLAST+) (44), and primers 
with 14 or more contiguous bases of homology 
were eliminated. Last, BLAST+ was again used to 
identify and exclude primers that had an 11-nt 
homology region at the 3’ end of any other 
primer or a 5-nt homology region at the 3’ end 
of the T7 promoter. The forward primer sequences 
(primer 1) were determined as described above, 
whereas the reverse primers each contain a 20-nt 
sequence as described above plus a 20-nt T7 
promoter sequence to facilitate amplification via 
in vitro transcription (primer 2). The primer se- 
quences used in the 140-gene and 1001-gene ex- 
periments are listed in Table 1. 

Thirty-nt-long readout sequences were created 
by concatenating fragments of the same orthog- 
onal primer set generated above by combining 
one 20-nt primer with a 10-nt fragment of an- 
other. These readout sequences were then screened, 
by using BLAST-+, for orthogonality with the in- 
dex primer sequences and other readout sequences 
(no more than 11 nt of homology) and for poten- 
tial off-target binding sites in the human genome 
(no more than 14 nt of homology). Fluorescently 
labeled readout probes with sequences comple- 
mentary to the readout sequences were used to 
probe these readout sequences, one in each hy- 
bridization round. All used readout probes se- 
quences are listed in Table 2. 

The readout probes used for the 140-gene 
libraries were probes 1 through 16. The readout 
probes used for the 1001-gene experiment were 
probes 1 through 14. “/3Cy5Sp/” indicates a 3’ Cy5 
modification. 

To design the central targeting sequences of 
the encoding probes, we first compiled the abun- 
dance of different transcripts in IMR90 cells 
using Cufflinks v2.1 (45), total RNA data from the 
Encyclopedia of DNA Elements (ENCODE) 
project (46), and human genome annotations 
from Gencode v18 (20). Probes were designed 
from gene models corresponding to the most 
abundant isoform by using OligoArray2.1 (47) 
with the following constraints: The target se- 
quence region is 30-nt long; the melting tem- 
peratures of the hybridized region of the probe 
and cellular RNA target is greater than 70°C; 
there is no cross hybridization targets with melt- 
ing temperatures greater than 72°C; there is no 
predicted internal secondary structures with 
melting temperatures greater than 76°C; and 
there is no contiguous repeats of six or more 
identical nucleotides. Melting temperatures were 
adjusted to optimize the specificity of these 
probes and minimize secondary structure while 
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Primer 1 sequence 
(index primer 1) 


Table 2. All used readout probes sequences. 


Primer 2 sequence 


(T7 promoter plus the reverse complement of index primer 2) 


TAATACGACTCACTATAGGGAAAGCCGGTTCATCCGGTGG 
TAATACGACTCACTATAGGGTGATCATCGCTCGCGGGTTG 
TAATACGACTCACTATAGGGCGTGGAGGGCATACAACGC 


Bit Readout probes 

1 CGCAACGCTTGGGACGGTTCCAATCGGATC/3Cy5Sp/ 
2 CGAATGCTCTGGCCTCGAACGAACGATAGC/3Cy5Sp/ 
3 ACAAATCCGACCAGATCGGACGATCATGGG/3Cy5Sp/ 
4 CAAGTATGCAGCGCGATTGACCGTCTCGTT/3Cy5Sp/ 
5 GCGGGAAGCACGTGGATTAGGGCATCGACC/3Cy5Sp/ 
6 AAGTCGTACGCCGATGCGCAGCAATTCACT/3Cy5Sp/ 
7 CGAAACATCGGCCACGGTCCCGTTGAACTT/3Cy5Sp/ 
8 ACGAATCCACCGTCCAGCGCGTCAAACAGA/3Cy5Sp/ 
9 CGCGAAATCCCCGTAACGAGCGTCCCTTGC/3Cy5Sp/ 
10 GCATGAGTTGCCTGGCGTTGCGACGACTAA/3Cy5Sp/ 
ll CCGTCGTCTCCGGTCCACCGTTGCGCTTAC/3Cy5Sp/ 
12 GGCCAATGGCCCAGGTCCGTCACGCAATTT/3Cy5Sp/ 
13 TTGATCGAATCGGAGCGTAGCGGAATCTGC/3Cy5Sp/ 
14 CGCGCGGATCCGCTTGTCGGGAACGGATAC/3Cy5Sp/ 
15 GCCTCGATTACGACGGATGTAATTCGGCCG/3Cy5Sp/ 
16 GCCCGTATTCCCGCTTGCGAGTAGGGCAAT/3Cy5Sp/ 


still producing sufficient numbers of probes for 
our libraries. To decrease computational cost, 
isoforms were divided into 1-kb regions for probe 
design. Using BLAST+, all potential probes that 
mapped to more than one cellular RNA species 
were rejected. Probes with multiple targets on 
the same RNA were kept. 

For each gene in the 140-gene experiments, we 
generated 198 putative encoding probe sequences 
by concatenating the appropriate index primers, 
readout sequences, and targeting regions as shown 
in fig S3. To address the possibility that con- 
catenation of these sequences introduced new 
regions of homology to off-target RNAs, we used 
BLAST+ to screen these putative sequences against 
all human ribosomal RNA (rRNA) and transfer 
RNA (tRNA) sequences as well as highly expressed 
genes [genes with fragments per kilobase per mil- 
lion reads (FPKM) > 10,000]. Probes with greater 
than 14 nt of homology to rRNAs or tRNAs or 
greater than 17 nt of homology to highly expressed 
genes were removed. After these cuts, we had 
~192 (with a standard deviation of 2) probes per 
gene for both MHD4 codebooks used in the 140- 
gene experiments. We followed the same proto- 
col for the 1001-gene experiments: Starting with 
96 putative targeting sequences per gene, we 
obtained ~94 (with a standard deviation of 6) 
encoding probes per gene after these additional 
homology cuts. We decreased the number of en- 
coding probes per RNA for the 1001-gene experi- 
ments so that these probes could be synthesized 
from a single 100,000-member oligopool as op- 
posed to two separate pools. We designed each 
encoding probe to contain two of the four read- 


out sequences associated with each code word; 
hence, only half of the bound encoding probes 
can bind readout probe during any given hybrid- 
ization round. We used ~192 or ~94 encoding 
probes per RNA to obtain high signal-to-background 
ratios for individual RNA molecules. The num- 
ber of encoding probes per RNA could be sub- 
stantially reduced but still allow single RNA 
molecules to be identified (17, 48, 49). In addi- 
tion, increasing the number of readout sequences 
per encoding probe or using optical sectioning 
methods to reduce the fluorescence background 
may allow further reduction in the number of 
the encoding probes per RNA. 

We designed two types of misidentification 
controls. The first control—blank words—were 
not represented with encoding probes. The sec- 
ond type of control—no-target words—had encod- 
ing probes that were not targeting any cellular 
RNA. The targeting regions of these probes were 
composed of random nucleotide sequences sub- 
ject to the same constraints used to design the 
RNA targeting sequences described above. More- 
over, these random sequences were screened 
against the human transcriptome to ensure that 
they contain no substantial homology (>14-nt) to 
any human RNA. The 140-gene measurements 
contained five blank words and five no-target 
words. The 1001-gene measurements contained 
11 blank words and five no-target words. 


Probe synthesis 


The encoding probes were synthesized by using 
the following four steps, and this synthesis pro- 
tocol is illustrated in fig. S3. 
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Step 1: The template oligopool (CustomArray) 
was amplified via limited-cycle PCR on a Bio-Rad 
CFX96 by using primer sequences specific to the 
desired probe set. To facilitate subsequent ampli- 
fication via in vitro transcription, the reverse 
primer contained the T7 promoter. All primers 
were synthesized by Integrated DNA Technol- 
ogies (IDT). This reaction was column purified 
(Zymo DNA Clean and Concentrator, D4003). 

Step 2: The purified PCR products were then 
further amplified ~200-fold and converted into 
RNA via a high yield in vitro transcription ac- 
cording to the manufacturer’s instructions [New 
England Biolabs (NEB), E2040S]. Each 20 uL re- 
action contained ~1 pg of template DNA from 
above, 10 mM of each NTP, 1x reaction buffer, 
1x RNase inhibitor (Promega RNasin, N2611) 
and 2 uL of the T7 polymerase. This reaction was 
incubated at 37°C for 4 hours to maximize yield. 
This reaction was not purified before the fol- 
lowing steps. 

Step 3: The RNA products from the above in 
vitro transcription reaction were then con- 
verted back into DNA via a reverse transcrip- 
tion reaction. Each 50-uL reaction contained 
the unpurified RNA produce from step 2 sup- 
plemented with 1.6 mM of each dNTP, 2 nmol 
of a reverse transcription primer, 300 units of 
Maxima H- reverse transcriptase (Thermo Sci- 
entific, EP0751), 60 units of RNasin, and a final 
1x concentration of the Maxima RT buffer. This 
reaction was incubated at 50°C for 45 min, and 
the reverse transcriptase was inactivated at 85°C 
for 5 min. The templates for the 140-gene libraries 
contain a common priming region for this re- 
verse transcription step; thus, a single primer 
was used for this step when creating these probes. 
Its sequence was CGGGTTTAGCGCCGGAAATG. 
A common priming region was not included for 
the 1001-gene library; thus, the reverse transcrip- 
tion was conducted with the forward primer: 
CGCGGGCTATATGCGAACCG. 

Step 4: To remove the template RNA, 20 uL of 
0.25 M EDTA and 0.5 N NaOH was added to the 
above reaction to selectively hydrolyze RNA, and 
the sample was incubated at 95°C for 10 min. 
This reaction was then immediately purified by 
means of column purification using a 100-ug- 
capacity column (Zymo Research, D4030) and 
the Zymo Oligo Clean and Concentrator proto- 
col. The final probes were eluted in 100 uL of 
ribonuclease (RNase)-free deionized water, evap- 
orated in a vacuum concentrator, and then re- 
suspended in 10 uL of encoding hybridization 
buffer (recipe below). Probes were stored at -20°C. 
Denaturing polyacrylamide gel electrophoresis 
and absorption spectroscopy were used to con- 
firm the quality of the probes and revealed that 
this probe synthesis protocol converts 90 to 100% 
of the reverse-transcription primer into full-length 
probe and of the probe that is constructed, 70 to 
80% is recovered during the purification step. 
This protocol is similar to another recently pub- 
lished protocol (23) but provides a substantially 
larger yield. 

Fluorescently labeled readout probes have se- 
quences complementary to the readout sequences 
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described above and a Cy5 dye attached at the 3’ 
end. These probes were synthesized and purified 
by means of high-performance liquid chroma- 
tography (HPLC) by IDT. 


Sample preparation and labeling 
with encoding probes 


Human primary fibroblasts (American Type Cul- 
ture Collection, IMR90), a commonly used cell 
line with a previously determined transcriptome 
(46), were used in this work. These cells are rel- 
atively large and flat, facilitating wide-field im- 
aging without the need for optical sectioning. 
Cells were cultured with Eagle’s Minimum Es- 
sential Medium. Cells were plated on 22-mm, #1.5 
coverslips (Bioptechs, 0420-0323-2) at 350,000 
cells per coverslip and incubated at 37°C with 5% 
CO, for 48 to 96 hours within petri dishes. Cells 
were fixed for 20 min in 4% paraformaldehyde 
(Electron Microscopy Sciences, 15714) in 1x phos- 
phate buffered saline (PBS; Ambion, AM9625) at 
room temperature, reduced for 5 min with 0.1% 
w/v sodium borohydride (Sigma, 480886) in wa- 
ter to reduce background fluorescence, washed 
three times with ice-cold 1x PBS, permeabilized 
for 2 min with 0.5% v/v Triton (Sigma, T8787) in 
1x PBS at room temperature, and washed three 
times with ice cold 1x PBS. 

Cells were incubated for 5 min in encoding 
wash buffer comprising 2x saline-sodium citrate 
buffer (SSC) (Ambion, AM9763), 30% v/v form- 
amide (Ambion, AM9342), and 2 mM vanadyl 
ribonucleoside complex (NEB, S1402S). Ten mi- 
croliters of 100 uM (140-gene experiments) or 
200 uM (1001-gene experiments) encoding probes 
in encoding hybridization buffer was added to 
the cell-containing coverslip and spread uniformly 
by placing another coverslip on top of the sample. 
Samples were then incubated in a humid chamber 
inside a 37°C-hybridization oven for 18 to 36 hours. 
Encoding hybridization buffer is composed of en- 
coding wash buffer supplemented with 1 mg/mL 
yeast tRNA (Life technologies, 15401-011) and 10% 
w/v dextran sulfate (Sigma, D8906-50G). 

Cells were then washed with encoding wash 
buffer, incubated at 47°C for 10 min, and this wash 
was repeated for a total of three times. A 1:1000 
dilution of 0.2-um-diameter carboxylate-modified 
orange fluorescent beads (Life Technologies, 
F-8809) in 2xSSC was sonicated for 3 min and 
then incubated with the sample for 5 min. The 
beads were used as fiducial markers to align 
images obtained from multiple successive rounds 
of hybridization, as described below. The sample 
was washed once with 2xSSC, and then post- 
fixed with 4% v/v paraformaldehyde in 2xSSC at 
room temperature for 30 min. The sample was 
then washed three times with 2xSSC and either 
imaged immediately or stored for no longer than 
12 hours at 4°C before imaging. All solutions 
were prepared as RNase-free. 


MERFISH imaging 


The sample coverslip was assembled into a 
Bioptech’s FCS2 flow chamber, and the flow 
through this chamber was controlled via a home- 
built fluidics system composed of three computer- 


controlled eight-way valves (Hamilton, MVP and 
HVXM 8-5) and a computer-controlled peristaltic 
pump (Rainin, Dynamax RP-1). The sample was 
imaged on a home-built microscope constructed 
around an Olympus IX-71 body and a 1.45 NA, 
100 oil immersion objective and configured for 
oblique incidence excitation. The objective was 
heated to 37°C with a Bioptechs objective heater. 
Constant focus was maintained throughout the 
imaging process with a home-built, autofocusing 
system. Illumination was provided at 641, 561, 
and 405 nm by using solid-state lasers (MPB 
communications, VFL-P500-642; Coherent, 561- 
200CWCDBH;; and Coherent, 1069413/AT) for 
excitation of our Cy5-labeled readout probes, 
the fiducial beads, and nuclear counterstains, re- 
spectively. These lines were combined with a 
custom dichroic (Chroma, zy405/488/561/647/ 
752RP-UFI) and the emission was filtered with 
a custom dichroic (Chroma, ZET405/488/561/ 
647-656/752m). Fluorescence was separated with 
a QuadView (Photometrics) by using the dichroics 
T560lpxr, T650lpxr, and 750dcxxr (Chroma) and 
the emission filters ET525/50m, WT59550m-2f, 
ET700/75m, and HQ770lp (Chroma) and imaged 
with an EMCCD camera (Andor, iXon-897). The 
camera was configured so that a pixel corre- 
sponds to 167 nm in the sample plane. The en- 
tire system was fully automated, so that imaging 
and fluid handling were performed for the entire 
experiment without user intervention. 

Sequential hybridization, imaging, and bleach- 
ing proceeded as follows. One milliliter of 10 nM 
of the appropriate fluorescently labeled readout 
probe in readout hybridization buffer (2xSSC; 
10% v/v formamide, 10% w/v dextran sulfate, 
and 2 mM vanadyl ribonucleoside complex) was 
flown across the sample, flow was stopped, and 
the sample was incubated for 15 min. Then 2 mL 
of readout wash buffer (2xSSC, 20% v/v form- 
amide, and 2 mM vanadyl ribonucleoside com- 
plex) was flown across the sample, flow was 
stopped, and the sample was incubated for 3 min. 
Two milliliters of imaging buffer comprising 
2xSSC, 50 mM TrisHCl pH 8, 10% w/v glucose, 
2 mM Trolox (Sigma-Aldrich, 238813), 0.5 mg/mL 
glucose oxidase (Sigma-Aldrich, G2133), and 
40 ug/mL catalase (Sigma-Aldrich, C30) was flown 
across the sample (50). Flow was then stopped, 
and then ~75 to 100 regions were exposed to 
~25 mW 642-nm and 1 mW of 561-nm light and 
imaged. Each region was 40 by 40 um. The laser 
powers were measured at the microscope back- 
port. Because the imaging buffer is sensitive to 
oxygen (51), the ~50 mL of imaging buffer used 
for a single experiment was made fresh at the 
beginning of the experiment and then stored 
under a layer of mineral oil throughout the mea- 
surement. Buffer stored in this fashion was sta- 
ble for more than 24 hours. 

After imaging, the fluorescence of the readout 
probes was extinguished via photobleaching. 
The sample was washed with 2 mL of photo- 
bleaching buffer (2xSSC and 2 mM vanadyl 
ribonucleoside complex), and each imaged re- 
gion of the sample was exposed to 200 mW of 
641-nm light for 3 s. To confirm the efficacy of 
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this photobleaching treatment, imaging buffer 
was reintroduced, and the sample was imaged 
as described above. 

The above hybridization, imaging, and photo- 
bleaching process was repeated either 16 times 
for the 140-gene measurements by using the MHD4: 
code or 14 times for the 1001-gene measurements 
by using the MHD2 code. An entire experiment 
was typically completed in ~20 hours. 

After completion of imaging, 2 mL of a 1:1000 
dilution of Hoescht (ENZ-52401) in 2xSSC was 
flown through the chamber to label the nuclei 
of the cells. The sample was then washed im- 
mediately with 2 mL of 2xSSC followed by 2 mL 
of imaging buffer. Each region of the sample 
was then imaged once again with ~1 mW of 
405-nm light. 

Because we imaged cells using wide-field im- 
aging with oblique-incidence illumination, without 
optical sectioning and z-scanning, we quantified 
the fraction of individual RNA species that was 
outside the axial range of our imaging geometry 
for six different RNA species using conventional 
smFISH. For this purpose, we optically sectioned 
these cells by collecting stacks of images at differ- 
ent focal depths through the entire depth of the 
cells. We aligned the images in consecutive focal 
planes and then computed for each cell the frac- 
tion of RNAs that were detected in the three- 
dimensional stack but not in the basal focal 
plane. We found that only a small fraction, 15 + 1% 
(mean + SEM across six different RNA species) of 
RNA molecules were outside the imaging range 
of a fixed focal plane without z-scanning. These 
measurements also confirmed that our excita- 
tion geometry illuminated the full depth of our 
cells. From an imaging perspective, any optical 
sectioning technique could be used in MERFISH 
to allow the imaging of RNAs in thicker cells or 
tissues. 


Construction of measured words 


Fluorescent spots were identified and localized 
in each image by using a multi-Gaussian-fitting 
algorithm (38) assuming a Gaussian with a uni- 
form width of 167 nm. This algorithm was used 
to allow partially overlapping spots to be distin- 
guished and individually fit. RNA spots were 
distinguished from background signal—signal 
arising from probes bound nonspecifically, by 
setting the intensity threshold required to fit a 
spot with this software. Because of variation in 
the brightness of spots between rounds of hy- 
bridization, this threshold was adjusted appro- 
priately for each hybridization round in order to 
minimize the combined average of the 1-0 and 
0—1 error rates across all hybridization rounds 
(140-gene measurements) or to maximize the 
ratio of the number of measured words with four 
1 bits to those with three or five 1 bits (1001-gene 
measurements). The location of the fiducial beads 
was identified in each frame by using a faster 
single-Gaussian fitting algorithm. 

Images of the same sample region in different 
rounds of hybridization were registered by ro- 
tating and translating the image to align the two 
fiducial beads within the same image that were 
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most similar in location after a coarse initial align- 
ment via image correlation. All images were aligned 
to a coordinate system established by the images 
collected in the first round of hybridization. The 
quality of this alignment was determined from 
the residual distance between five additional 
fiducial beads, and alignment error was typically 
~20 nm. 

Fluorescence spots in different hybridization 
rounds were connected into a single string, cor- 
responding to a potential RNA molecule, if the 
distance between spots was smaller than 1 pixel 
(167 nm). For each string of spots, the on-off 
sequence of fluorescent signals in all hybridiza- 
tion rounds were used to assign a binary word to 
the potential RNA molecule, in which 1 was as- 
signed to the hybridization rounds that con- 
tained a fluorescent signal above threshold and 0 
was assigned to the other hybridization rounds. 
Measured words were then decoded into RNA 
species by using the 16-bit MHD4 code or the 14- 
bit MHD2 code discussed in the main text. In the 
case of the 16-bit MHD4 code, if the measured 
binary word matched the code word of a specific 
RNA perfectly or differed from the code word by 
one single bit, it was assigned to that RNA. In the 
case of the 14-bit MHD2 code, only if the mea- 
sured binary word matched the code word of a 
specific RNA perfectly was it assigned to that 
RNA. To determine the copy number per cell, the 
number of each RNA species was counted in in- 
dividual cells within each 40- by 40-um imaging 
area. This number accounts for the majority but 
not all RNA molecules within a cell because a 
fraction of the cell could be outside the imaging 
area or focal depth. Tiling images of adjacent 
areas and adjacent focal planes could be used to 
improve the counting accuracy. 

In the 140-gene experiments, some regions of 
the cell nucleus occasionally contained too much 
fluorescence signal to properly identify individ- 
ual RNA spots. In the 1001-gene experiments, the 
cell nucleus generally contained too much fluo- 
rescent signal to allow identification of individ- 
ual RNA molecules. These bright regions were 
excluded from all subsequent analysis. This work 
focuses on mRNAs, which are enriched in the 
cytoplasm. To estimate the fraction of mRNAs 
missed by excluding the nucleus region, we used 
conventional smFISH to quantify the fraction of 
molecules found inside the nucleus for six dif- 
ferent mRNAs species. We found that only 5 + 2% 
(mean + SEM across six RNA species) of these 
RNA molecules are found in the nucleus. Use 
of super-resolution imaging and/or optical sec- 
tioning could potentially allow individual mole- 
cules in these dense nucleus regions to be 
identified, which will be particularly useful for 
probing those noncoding RNAs that are enriched 
in the nucleus. 


smFISH measurements of individual genes 


Pools of 48 fluorescently labeled (Quasar 670) 
oligonucleotide probes per RNA were purchased 
from Biosearch Technologies. Thirty-nucleotide 
probe sequences were taken directly from a ran- 
dom subset of the targeting regions used for the 


multiplexed measurements. Cells were fixed 
and permeabilized as described above. Ten mi- 
croliters of 250 nM oligonucleotide probes in 
encoding hybridization buffer (described above) 
was added to the cell-containing coverslip and 
spread uniformly by placing another coverslip 
on top of the sample. Samples were then in- 
cubated in a humid chamber inside a 37°C- 
hybridization oven for 18 hours. Cells were then 
washed with encoding wash buffer (described 
above) at 37°C for 10 min, and this wash was 
repeated for a total of three times. The sample 
was then washed three times with 2xSSC and 
imaged in imaging buffer by using the same im- 
aging geometry as described above for MERFISH 
imaging. 

Bulk RNA sequencing 


Total RNA was extracted from IMR90 cells cul- 
tured as above using the Zymo Quick RNA 
MiniPrep kit (R1054) according to the manufac- 
turer’s instructions. Polyadenylated [poly(A)] RNA 
was then selected (NEB, E7490), and a sequencing 
library was constructed by using the NEBNext 
Ultra RNA library preparation kit (NEB, E7530), 
amplified with custom oligonucleotides, and 
150-base pair (bp) reads were obtained on a 
MiSeq. These sequences were aligned to the hu- 
man genome (Gencode v18) and isoform abun- 
dance was computed with cufflinks (45). 


Calculation of the predicted scaling 
and error properties of different 
encoding schemes 


Analytic expressions were derived for the depen- 
dence of the number of possible code words, the 
calling rate, and the misidentification rate on N. 
The calling rate is defined as the fraction of RNA 
molecules that are properly identified. The mis- 
identification rate is defined as the fraction of 
RNA molecules that are misidentified as a wrong 
RNA species. For encoding schemes with an error- 
detection capability, the calling rate and misiden- 
tification rate does not add up to 1 because a 
fraction of the molecules not called properly can 
be detected as errors and discarded and, hence, 
not misidentified as a wrong species. These cal- 
culations assume that the probability of misread- 
ing bits is constant for all hybridization rounds 
but differs for the 1-0 and 0-1 errors. Exper- 
imentally measured average 10 and 0-1 error 
rates (10 and 4%, respectively) were used for the 
estimates shown in Fig. 1, B to D. For simplicity, 
the word corresponding to all Os was not re- 
moved from calculations. 

For the simple binary encoding scheme in 
which all possible N-bit binary words are assigned 
to different RNA species, the number of possible 
code words is 2%. The number of words that 
could be used to encode RNA is actually 2% - 
1 because the code word “00...0” does not con- 
tain detectable fluorescence in any hybridization 
round, but for simplicity the word corresponding 
to all Os was not removed from subsequent cal- 
culations. The error introduced by this approxi- 
mation is negligible. For any given word with m 
1s and N - m Os, the probability of measuring 
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that word without error—the fraction of RNAs 
that is properly called—is 


(1-21)"(1- Po)” (1) 


where p, is 1-0 error rate and pp is 01 error 
rate per bit. Because different words in this sim- 
ple binary encoding scheme can have different 
numbers of 1 bits, the calling rate for different 
words will differ if p, + po. The average calling 
rate, reported in Fig. 1C, was determined from 
the weighted average of the value of Eq. 1 for all 
words. This weighted average is 


24m JAPA)" Boy (2) 


Wmi=o\, m 


IN\» , ‘ : 
where a ) is the binomial coefficient and cor- 


responds to the number of words with 7 1 bits in 
this encoding scheme. Because in this encoding 
scheme every error produces a binary word that 
encodes a different RNA, the average misidenti- 
fication rate for this encoding scheme, reported 
in Fig. 1D, follows directly from Eq. 2: 


To calculate the scaling and error properties of 
the extended Hamming distance 4 (HD4) code, we 
first created the generator matrix for the desired 
number of data bits using standard methods (27). 
The generator matrix determines the specific words 
that are present in a given encoding scheme and 
was used to directly determine the number of en- 
coded words as a function of the number of bits. In 
this encoding scheme, the calling rate corresponds 
to the fraction of words measured without error as 
well as the fraction of words measured with a 
single-bit error. For code words with m1 bits, this 
fraction is determined by the following expression: 


el -p,)""(1 = Do)" + 
mp} (1- py)" \(1- po)” + 
(N-m)ps(1-p1)"(1-po)” ™" ~— (4) 


where the first term is the probability of not making 
any errors, the second term corresponds to the total 
probability of making one 1-0 error at any of the 
m1 bits without making any other 0—1 errors, and 
the final term corresponds to the total probability 
of making one 0-1 error at any of the N-m 0 bits 
without making any 1-0 errors. Because the num- 
ber of 1 bits can differ between words in this 
encoding scheme, the average calling rate reported 
in Fig. 1C was computed from a weighted average 
over Eq. 4 for different values of m. The weight for 
each term was determined from the number of 
words that contain m™ 1 bits as determined from 
the generator matrix described above. 

Because RNA-encoding words are separated 
by a minimum Hamming distance of 4, at least 4 
errors are required to switch one word into an- 
other. If error correction is applied, then 3 or 5 
errors could also convert one RNA into another. 
Thus, we estimate the misidentification rate from. 
all possible combinations of 3-bit, 4-bit, and 5-bit 
errors for code words with m 1 bits. Technically, 
>5-bit errors could also convert one RNA into 
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another, but the probability of making such er- 
rors is negligible because of the small per-bit er- 
ror rate. We approximate this expression with 


4 /m\(N- . . = are, 
3-2) pe =n)" =p) + 


3 /m\(N - eae i _m-(3-% 
2) 57) (Q-pr)"""A-poyh P+ 


> ee ) ping -p)"*(1-poy 
ot /\ 5-2 
(5 

The first sum corresponds to all of the ways in 
which exactly four mistakes can be made. Sim- 
ilarly, the second and third sums correspond to 
all of the ways in which exactly three or five 
mistakes can be made. Equation 5 provides an 
upper bound for the misidentification rate be- 
cause not all 3-, 4-, or 5-bit errors produce a word 
that matches or would be corrected to another 
legitimate word. Again because the number of 
1 bits can differ between words, the average mis- 
identification rate reported in Fig. 1D is calcu- 
lated as a weighted average of Eq. 5 over the 
number of words that have m 1 bits. 

To generate our MHD4 code in which the 
number of 1 bits for each code word is set to 4, we 
first generated the HD4 codes as described above, 
and then removed all code words that did not 
contain four 1s. The calling rate of this code, 
reported in Fig. 1C, was directly calculated from 
Eq. 4, but with m = 4 because all code words in 
this code have four 1 bits. The misidentification 
rate of this code, reported in Fig. 1D, was cal- 
culated by modifying Eq. 5 with the following 
considerations: (i) the number of 1 bits, 7, was 
set to 4 and (ii) errors that produce words that 
do not contain three, four, or five 1 bits were 
excluded. Thus, the expression in Eq. 5 was 
simplified to 


(4) (24) g0-nre-nis 
@ (75 *) piesa - m2)" -poy"* ys 


(3) ee *ptpba -p,)2(1- po)" 
e e *) piesa — p,)*(1-po)*" + 


(5) (12 * otra ay") 
(6) 


Again, this expression is an upper bound on the 
actual misidentification rate because not all words 
with four 1s are valid code words. 


Estimates of the 1-0 and O-1 error rates 
for each hybridization round 


To compute the probability of misreading a bit at 
a given hybridization round, we used the error- 
correcting properties of the MHD4 code. Briefly, 
the probabilities of 1-0 or 0—1 errors were de- 
rived in the following way. Let the probability of 


making an error at the 7th bit, —7th hybridization 
round—be p; and the actual number of RNA mol- 
ecules of the given species be A, then the num- 
ber of exact matches for this RNA will be 


16 
Wy, =AJ[]|(1-p;), and the number of one-bit 
i=l 


error corrected matches for this RNA corre- 
sponding to errors at the ith bit will be W; = 
A sai — p;). The p; can be directly derived 
from the ratio: W;/Ws = a Poy This ratio as- 
sumes that the 1-bit error-corrected counts were 
only generated from single-bit errors from the 
correct word and that multi-error contamination 
from other RNA words is negligible. Given that 
our error rate per hybridization round is small 
and that it takes at least three errors to convert 
one RNA-encoding word into a word that would 
be misidentified as another RNA, the above ap- 
proximation should be a good one. 

To compute the average 1-0 or 0-1 error 
probabilities for each of the 16 hybridization 
rounds, we use the above approach to calculate 
the per-bit error rates for each bit of every gene, 
sort these errors on the basis of whether they 
correspond to a 1-0 or a O->1 error, and then 
take the average of these errors for each bit 
weighted by the number of counts observed for 
the corresponding gene. 


Estimates of the calling rate for individual 
RNA species from actual imaging data 


With the estimates of the 1-0 or 0-1 error 
probabilities for each round of hybridization as 
determined above, it is possible to estimate the 
calling rate for each RNA according to the spe- 
cific word used to encoded it. Specifically, the 
fraction of an RNA species that is called correctly 
is determined by 


a ee: ae a : 
Ha -pi) + La ppl -pi) (7) 


where the first term represents the probability of 
observing an exact match of the code word and 
the second term represents the probability of ob- 
serving an error-corrected match (with 1-bit er- 
ror). The values of the per-bit error rate p; for 
each RNA species are determined by the specific 
code word for that RNA and the measured 1-0 
or 0-1 error rates for each round of hybridization. 
If the code word of the RNA contains a 1 in the 7th 
bit, then p; is determined from the 1-0 error rate 
for the ith hybridization round; if the word con- 
tains a 0 in the ith bit, p; is determined from the 
0—1 error rate for the ith hybridization round. 


Hierarchical clustering analysis of the 
co-variation in RNA abundance 


Hierarchical clustering of the covariation in gene 
expression for both the 140- and 1001-gene ex- 
periments was conducted as follows. First, the 
distance between every pair of genes was deter- 
mined as 1 minus the Pearson correlation coef- 
ficient of the cell-to-cell variation of the measured 
copy numbers of these two RNA species, both 
normalized by the total RNA counted in the cell. 
Thus, highly correlated genes are “closer” to 
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one another, and highly anticorrelated genes are 
“further” apart. An agglomerative hierarchical 
cluster tree was then constructed from these dis- 
tances using the unweighted pair group method 
with arithmetic mean (UPGMA). Specifically, start- 
ing with individual genes, we constructed hier- 
archical clusters by identifying the two clusters 
(or individual genes) that are closest to one an- 
other according to the arithmetic mean of the 
distances between all intercluster gene pairs. The 
pairs of clusters (or individual genes) with the 
smallest distance are then grouped together, and 
the process is repeated. The matrix of pairwise 
correlations was then sorted according to the 
order of the genes within these trees. 

Groups of genes with substantial covariations 
were identified by selecting a threshold on the 
hierarchical cluster tree (indicated by the dashed 
lines in Figs. 3D and 6A) that produced ~10 
groups of genes, each of which contains at least 
four members for the 140-gene experiments or 
~100 groups each of which contains at least three 
members for the 1001-gene experiments. One 
can change the threshold in order to identify ei- 
ther more tightly coupled smaller groups or larger 
groups with relatively loose coupling. 

A probability value for the confidence that a 
gene belongs to a specific group was determined 
by computing the difference between the average 
correlation coefficient between that gene and all 
other members of that group and the average 
correlation coefficient between that gene and all 
other measured genes outside that group. The 
significance (P value) of this difference was de- 
termined with the student’s ¢ test and is provided 
in tables S2 and S4. 

Because hierarchical clustering is inherently a 
one-dimensional analysis—any given genes can 
only be a member of a single group—this analysis 
does not allow all correlated gene groups to be 
identified. Higher-dimension analysis, such as 
principal component analysis or k-means clus- 
tering, could be used to identify more covarying 
gene clusters (30). 


Analysis of RNA spatial distributions 


To identify genes that have similar spatial dis- 
tributions, we subdivided each of the measured 
cells into 2 by 2 regions and calculated the frac- 
tion of each RNA species present in each of these 
bins. To control for the fact that some regions of 
the cell naturally contain more RNA than others, 
we calculate the enrichment for each gene—the 
ratio of the observed fraction in a given region 
for a given RNA species to the average fraction 
observed for all genes in that same region. For 
each pair of RNA species, we then determined 
the Pearson correlation coefficient of the region- 
to-region variation in enrichment of these two 
RNA species for each cell and averaged the cor- 
relation coefficients over ~400 cells imaged in 
seven independent data sets. We then clustered 
RNA species on the basis of these average cor- 
relation coefficients using the same hierarchical 
clustering algorithm described above. Because of 
the large number of cells used for the analysis, 
we found that the coarse spatial binning (2 by 2 
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regions per cell) was sufficient to capture the spa- 
tial correlation between genes, and finer binning 
did not produce more significantly correlated 
groups. 

To measure the distances of genes from the 
nuclei and from the cell edge, we first used bright- 
ness thresholds on our cell images to segment 
the nuclei and identify the cell edge. We then 
measured the distance from every RNA molecule 
to the nearest part of the nucleus and nearest 
part of the cell edge. For each data set, we com- 
puted the average distance for each RNA species 
averaged over all the cells measured. We then 
averaged these distances for the group I genes, 
group II genes, or all genes. Only those RNA 
species with at least 10 counts per cell were used 
in this analysis to minimize statistical error on 
the distance values. 


GO analysis 


Groups of genes were selected from the hierar- 
chical trees as discussed above. A collection of 
GO terms (37) was determined for all measured 
RNA species as well as the RNA species asso- 
ciated with each group from the most recent hu- 
man GO annotations (http://geneontology.org/ 
page/download-annotations) by using both the 
annotated GO terms and terms immediately up- 
stream or downstream of the found annotations. 
The enrichment of these annotations was calcu- 
lated from the ratio of the fraction of genes 
within each group that have this term to the 
fraction of all measured genes that have this 
term and the P value for this enrichment was 
calculated via the hypergeometric function. Only 
statistically significantly enriched GO terms with 
a P value less than 0.05 were considered. 
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SUPERNOVAE 


Old supernova dust factory revealed 
at the Galactic center 


R. M. Lau,’* T. L. Herter,’ M. R. Morris,” Z. Li,® J. D. Adams’** 


Dust formation in supernova ejecta is currently the leading candidate to explain the 

large quantities of dust observed in the distant, early universe. However, it is unclear 
whether the ejecta-formed dust can survive the hot interior of the supernova remnant 
(SNR). We present infrared observations of ~0.02 solar masses of warm (~100 kelvin) 
dust seen near the center of the ~10,000-year-old Sagittarius A East SNR at the Galactic 
center. Our findings indicate the detection of dust within an older SNR that is expanding 
into a relatively dense surrounding medium (electron density ~10° centimeters~°) and 
has survived the passage of the reverse shock. The results suggest that supernovae may be 
the dominant dust-production mechanism in the dense environment of galaxies of the 


early universe. 


he search for the dominant formation mech- 
anism of large quantities of dust detected 
in galaxies of the early universe remains 
an open investigation of profound impor- 
tance. It both influences observed emission 
and greatly affects the formation of future gen- 
erations of stars. Due to the short lifetimes of 
their progenitor stars and to their highly metal- 
enriched ejecta, supernovae (SNe) are believed 
to be efficient sources of dust production (J). 
However, the powerful supernova (SN) explo- 
sions and the resulting shocks are predicted to 
also be very effective at destroying and shatter- 
ing dust: Depending on the energy of the explo- 
sion and the density of the surrounding medium, 
less than ~20% of the mass of the SN-condensed 
dust is expected to survive the passage of the 
reverse shock that is driven back into the ejecta 
due to the difference between the thermal pres- 
sure of the shocked circumstellar material and 
that of the expanding ejecta (2-4). In fact, re- 
cent studies have argued that SNe may be net 
destroyers of dust in present-day galaxies [e.g., 
in the Magellanic clouds (5, 6)] but net producers 
of dust in the earliest-forming galaxies in the 
universe (7). However, no direct observational 
evidence currently exists of the quantities of SN- 
condensed dust surviving the passage of the 
reverse shock through the ejecta. 
Sagittarius A (Sgr A) East is the well-studied 
remnant of a core-collapse SN, located 8 kpc 
away near the center of our Galaxy (8), that has 
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blown out a ~3' (7-pc)-diameter shell of non- 
thermal radio emission (Fig. 1) (9, 10). Based 
on the kinematics of maser spots immediately 
surrounding the supernova remnant (SNR), it is 
estimated to be ~5 pc from the Galactic center 
(i). An age of ~10* years has been estimated 
for the SNR, based on predictions of its elon- 
gation from tidal forces (12) and, more recently, 
from the observed proper motion and displace- 
ment of the neutron star believed to be the 
remnant of the SN progenitor from the interior 
of the SNR (13). The ~10*-year time scale and the 
metal-rich hot ejecta from the center of the SNR 
imply that the reverse shock has reached the 
center of Sgr A East (74). Observations of dust 
associated with the Sgr A East SNR could there- 


fore provide estimates of the fraction of dust 
that survives the destructive passage of the re- 
verse shock. 

We use mid- and far-infrared (IR) images of 
Sgr A East from the Stratospheric Observatory 
for Infrared Astronomy (SOFIA) to argue the pres- 
ence of warm (7, ~ 100 K) dust near its center. 
The warm and dusty environment of the Galactic 
center (15) presents a challenge in definitively 
demonstrating that the IR emission is associated 
with the Sgr A East SNR ejecta. Our claim is sub- 
stantiated by four major results from the obser- 
vations: (i) The location of the IR-emitting region 
is consistent with the center of the SNR and is 
spatially anticorrelated with the hard x-ray emis- 
sion, suggesting that dust is in a cooler region of 
the ejecta. (ii) Analysis of the dust temperatures 
and heating sources shows that the location con- 
straints are consistent with Sgr A East because 
the IR-emitting region must be radiatively heated 
by the optical and ultraviolet photons from the 
central stellar cluster, which dominates the ra- 
diation field in the central parsecs of the Galaxy. 
(iii) The lack of cold dust emission at submilli- 
meter wavelengths coincident with the IR-emitting 
region implies that it does not originate from 
a cloud along the line of sight. (iv) We show that 
the typical size of the emitting grains must be 
smaller than the dust in the surrounding inter- 
stellar medium (ISM), which is consistent with 
having been processed by the reverse shock of 
the SN. 

We propose that the dust has survived the 
passage of the reverse shock due to the relatively 
large density of the surrounding medium, which 
slows the ejecta and places it into a temperature 
and density regime where it will undergo sub- 
stantial radiative cooling on time scales much 
shorter than dust-destruction time scales. With 
an estimate of the surviving dust mass in Sgr 
A East, we discuss the viability of SNe as the 


Fig. 1. Multiwavelength imaging of the Sgr A East supernova remnant. (A) Composite false-color 
image of the Sgr A East SNR overlaid with contours of the 31.5-m emission east of the CND. North is up 
and east is left. The colors correspond to emission from 2 to 8 keV (blue; Chandra/ACIS-l), 160 um (green; 
Herschel/PACS), and 6 cm (red; Very Large Array). The dotted ellipse delineates the boundary of the Sgr A 
East radio shell, with the cross indicating the location of its apparent center. (B) Fe Ka (6.7 keV; blue) 
emission from the SNR overlaid with the 31.5-um emission contours and a twice-magnified inset of the 
SNR's IR-emitting region (orange). 
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dominant source of dust production in the early 
universe. 


Observations and results 


The Faint Object Infrared Camera for the SOFIA 
Telescope (FORCAST) (16) was used to obtain 
images of the Sgr A East SNR at 7.7, 19.7, 25.2, 
31.5, and 37.1 um. The spatially resolved hot and 
warm dust traced by these wavelength bands 
is shown in Fig. 2. We used the Chandra/Advanced 
CCD Imaging Spectrometer imaging array (ACIS-1) 
to obtain high-quality hard x-ray (2 to 8 keV) im- 
ages of Sgr A East. Additionally, we incorporated 
archival mid-IR (5.8 and 8.0 um) and submilli- 
meter (70 and 160 um) observations taken by the 
Spitzer Space Telescope’s Infrared Array Camera 
(IRAC) (7, 18) and the Herschel Space Observ- 
atory’s Photodetector Array Camera and Spec- 
trometer (PACS) (19, 20), respectively (21). 


Source morphology and location 


The mid-IR (5.8 and 8 um) and far-IR (19 to 37 um) 
observations of the proposed SNR dust emission 
are cospatial and confined to the regions near 
the center of the Sgr A East radio shell (Fig. 1A). 
The central position of the dust emission is con- 
sistent with the expected location of dust having 
condensed within the ejecta. Furthermore, the 
dust is spatially anticorrelated with the hard 
x-ray continuum of the SNR ejecta (Fig. 1B). This 
anticorrelation suggests that the dust is located 
in a much cooler and less hostile region of the 
ejecta. 

It is apparent in Fig. 1A that there is no major 
submillimeter emission coincident with the lo- 
cation of the IR-emitting region. This is in con- 
trast to the Sgr A East HII regions that are located 
~3 pc to the east in projection from the center 


of the SNR and are the illuminated edges of the 
molecular cloud associated with the prominent 
ridge of submillimeter emission that extends 
along the eastern side of Sgr A East (22). The 
lack of submillimeter emission at the IR-emitting 
region therefore implies that it is not associated 
with a cold molecular cloud along the line of 
sight toward the center of Sgr A East. 


Observed dust temperature 


A key to understanding whether the dust is lo- 
cated interior to the Sgr A East SNR is its ther- 
mal structure. The closest known stellar heating 
source is the central cluster of massive young 
stars surrounding the Galactic black hole, lo- 
cated ~3 pc away in projection. We also consider 
heating via collisions with thermal electrons 
in the shocked ejecta of the SNR but conclude 
that it is negligible (23). The presence of the 
strong radiation field from the central cluster 
therefore presents a distinctive heating sce- 
nario for dust in Sgr A East because the radia- 
tive heating contribution, which typically arises 
from the interstellar radiation field in most 
SNRs, is usually negligible compared with colli- 
sional heating. 

To investigate the heating source(s) and thermal 
structure of the possible SNR dust, we generate 
a color temperature map using the ratio of the 
deconvolved 31.5- and 37.1-um (31/37) flux maps 
of Sgr A West and East observed by SOFIA/ 
FORCAST (Fig. 3) (24, 25). We assume the dust 
emission is optically thin and takes the form 
F,<B,(Ta)v* (A, flux density; B,, function for 
blackbody radiation; v, frequency at which the 
dust is emitting; B, frequency power-law index), 
where a value of 2 is adopted for B, which is typ- 
ical for interstellar dust. The longer-wavelength 


images are used to produce the color temper- 
ature map because the signal-to-noise ratio from 
the IR-emitting region at those wavelengths is 
higher than at 19.7 um. The average color tem- 
perature derived from the ratio of the 19.7- and 
37.1-um (19/37) fluxes of the IR-emitting re- 
gion is consistent with the average 31/37 color 
temperature. 

It is apparent from the location of the tem- 
perature peak and the negative radial tempera- 
ture gradient centered on Sagittarius A* (Sgr A*) 
that the luminous central cluster dominates the 
heating of the dust in the HII region and cir- 
cumnuclear disk (CND) immediately surround- 
ing Sgr A* (24). The proposed SNR dust exhibits 
a temperature of ~100 + 8 K (26), which is much 
greater than the ~75 K that would be predicted 
for 0.1-um-sized grains from the radial temper- 
ature gradient centered approximately on Sgr A*, 
allowing for a \/2 projection factor. We note that 
the ~75-K temperature of the structure we refer 
to as the Northern Dust Cloud (see Fig. 3) is con- 
sistent with this gradient and is equidistant from 
Sgr A* in projection with the SNR dust. The four 
Sgr A East HII regions seen to the east of the 
SNR (Figs. 1 and 3) exhibit temperature maxima, 
but they are each heated locally by sources of 
10* to 10° Le (Lo, solar luminosity) (22, 25). 


Heating source and dust size 


Three different scenarios can be considered to 
explain why the proposed SNR dust is at a con- 
siderably higher temperature than expected 
from heating by the central stellar cluster: (i) 
The dust is locally heated by a luminous stellar 
source or sources, (ii) the dust is heated by en- 
ergetic electron collisions in the shocked ejecta, 
or (iii) the dust is much smaller in size than the 
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Fig. 2. Mid- and far-IR images of the IR-emitting region. (A to H) Spitzer/IRAC, SOFIA/FORCAST, and Herschel/PACS flux maps the Sgr A East SNR dust. 
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dust associated with the CND and surrounding 
ISM. The first scenario would probably imply 
that the dust is associated with an illuminated 
molecular cloud along the line of sight toward 
Sgr A East, whereas the latter two scenarios would 
constrain the location of the dust to the interior 
of Sgr A East. 

First, we consider dominant heating by a local 
stellar source, as is the case for the Sgr A East HII 
regions. Given the size of the SNR’s IR-emitting 
region (~0.8 pc), we assume that any such local 
heating source(s) would be located at a distance 
of ~0.4 pc from the dust. For 0.1-um-sized silicate 
grains that are in thermal equilibrium with the 
radiation field, the total stellar luminosity of the 
heating source would then have to be ~2 x 10° Lo 
to reproduce dust temperatures of ~100 K. A 
source with such a high luminosity, which is 
similar to that of luminous blue variables, would 
be easily detected in the near-IR (1.90 um) (27, 28) 
and near-IR Spitzer/IRAC (18) observations of 
the region; however, no such heating source can 
be identified within several parsecs of the dust. 
Additionally, the total integrated infrared lumi- 
nosity of the IR-emitting region is ~7 x 10* Lo, 
which is too high if it were heated by a group of 
dusty asymptotic giant branch stars. Conversely, 
this total infrared luminosity is too low and the 
temperatures are too uniform for the IR emis- 
sion to be associated with extremely massive, 
evolved dust-enshrouded sources, such as Wolf- 
Rayet stars (L, > 10° Lo, where L, is the stellar 
luminosity). We therefore rule out the possibility 
that a local heating source is responsible for the 
observed dust temperatures. 

Dust heating due to collisions with electrons 
is commonly observed in SNRs, given the ener- 
getic conditions of the ejecta (29). Because of 
the apparent location of the dust outside the 
regions of hard x-ray emission (see Fig. 1B), 
the dust will be in a cooler and denser region 


Fig. 3. SOFIA/FORCAST flux and temperature maps of Sgr A East and West. 
(A) False-color image of dust emission from the Sgr A West and East regions. The 
colors correspond to 19.7 um (blue), 31.5 um (green), and 371 um (red), and the 
31.5-um flux contours east of the CND are overlaid. The central cluster is located in 
the subparsec vicinity of Sgr A* N, north; E, east; W, west; SE, southeast; NC, north 
clump; SC, south clump. (B) 31/37 dust temperature map of the Sgr A West and 
East regions. Overlaid are the theoretical dust temperature contours at 75 K (red) 
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of the ejecta. Taking the electron density and 
temperature to be ~100 cm™ and ~10° K, respec- 
tively, consistent with conditions that result in 
a short ejecta-cooling time scale (t.o.) << 10* 
year), we find that the radiation by the central 
cluster will dominate the heating over electron 
collisions for all grain sizes. 

In the final scenario, we propose that the 
grains composing the proposed SNR dust are 
smaller than the typical ~0.1-m size (24) and are 
therefore heated to higher temperatures due to 
lower heat capacities. The 31/37 dust temperature 
map in Fig. 3 is overlaid with several contours 
of the predicted dust equilibrium temperatures 
for 0.1-1m silicate and 0.01-um amorphous carbon 
grains, where we have only considered radiative 
heating from the central cluster and assumed 
no blockage by the CND. The 0.1-um dust tem- 
perature contours show clear agreement with 
the observed temperatures in the CND as well as 
the Northern Dust Cloud; however, the contours 
underpredict the temperatures observed for 
the proposed SNR dust. Shifting to the smaller 
0.01-um-sized grains shows much closer agree- 
ment between the predicted and observed dust 
temperatures (~100 K). 

We conclude that the inconsistency between 
the observed dust temperature of the proposed 
SNR dust and the Northern Dust Cloud and CND 
is due to a composition of smaller grains, which is 
consistent with expectations for dust in a SNR 
in which fragmentation and thermal sputtering 
have taken place after grain formation. 


Dust spectral energy distribution models 


The observed spectral energy distribution (SED) 
provides important constraints on the size of 
the dust particles, their physical location and 
the heating source(s). With this in mind, we in- 
vestigate six different regions in and around 
the SNR dust (Fig. 4) and use the DustEM code 


31/37 Dust Temperature 


——— 100 K (agy = 0.1 pzm) 
——— 75 K (asq = 0.1 pam) 
—-— 100 K (dance = 0.01 ppm) 


(30) to produce dust models that fit the ob- 
served SEDs. Three of the regions cover sites of 
prominent far-IR emission: north clump, south 
clump, and east. The remaining three regions 
cover sites of strong emission observed at mid- 
IR and submillimeter wavelengths that may 
or may not be associated with the SNR: south- 
east, west (W), and north. For the models, we 
perform a linear least-squares fit to the dered- 
dened fluxes observed at 5.8, 8.0, 19.7, 25.2, 31.5, 
and 37.1 um. We assume that the dust is com- 
posed of two independent grain distributions: 
~0.04-um-sized [large grain (LG)] and ~0.001-um- 
sized [very small grain (VSG)] amorphous car- 
bon grains heated radiatively by the central 
cluster, which we modeled as a point source 
with a luminosity of 4x 10’ Le (82) and the 
spectrum of a stellar atmosphere with an ef- 
fective temperature of 37,000 K (32), represent- 
ative of the luminous stars that dominate the 
radiation from the cluster. Because the distance 
between the central cluster and the proposed 
SNR dust (d) is uncertain, we allow d to vary as a 
free parameter in our models. The best-fit models 
(Fig. 4 and Table 1) show that d is consistent with 
the ~5-pc separation distance between the SNR 
and Sgr A* estimated by kinematics of maser 
spots associated with the SNR (Z2). 

The LG and VSG mass abundances are the 
other two free parameters for the models, in ad- 
dition to the distance between the central cluster 
and the dust. Assuming a distinctly larger-sized 
(0.1-um) dust distribution results in very poor 
SED fits for all of the regions except W. We there- 
fore require an independent distribution of VSGs 
for which transient heating allows us to fit the 
5.8- and 8.0-um flux points. A modified black- 
body fit to the 5.8- and 8.0-um points yields tem- 
peratures of ~350 K, which cannot be achieved 
if the dust is equilibrium-heated. The results from 
our best-fit models show that the VSG-to-LG 


and 100 K (orange) for O.1-um-sized silicate grains and at 100 K (green) for 
0.01-um-sized amorphous carbon grains, assuming equilibrium radiative heating 
by the central cluster. The apparent linear structure northeast of Sgr A* seen in 
the temperature contours is an artifact from combining the images of Sgr A East 
and Sgr A West, and the temperature peak at the south of Sgr A West is probably 
associated with an embedded source that appears more prominently in the mid- 
IR Spitzer/IRAC images (18). In both panels, north is up and east is left. 
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Fig. 4. Spectral energy distribution models fit to the measured flux 
throughout the IR-emitting region. (A to C) Sgr A East dust emission region. 
North is up and east is left. Overlaid are the six locations across the region where 
SEDs were extracted. The aperture size used to extract the fluxes is shown 
in the lower right corner. Fluctuations in the amplitudes of the background 
emission in the 70-um Herschel/PACS image can vary on the order of the 


mass abundance ratio for all regions except W 
ranges from 15 to 90% (Fig. 4), which suggests a 
VSG enhancement when compared with the typ- 
ical ratio of the Milky Way ISM (~13%) (33), as 
well as that of the compact Sgr A East HII re- 
gions in the 50 km s” cloud ~3 pc east of the 
proposed SNR dust (~2 to 4%) (25). 

To produce adequate fits to the SED of region 
W, we are required to either increase d to ~10 pc 
or increase the larger grain size to 0.2 um. Be- 
cause d ~ 10 pc would suggest that the region W 
dust is located twice as far away as the dust at 
the other regions, we adopt the more likely inter- 
pretation that the dust grains are larger. Regard- 
less of interpretation, the resulting inconsistent 
model fits to the region W SED strongly suggest 
that region W is not associated with the other 
regions and might thus be an extension of the 
dust distribution of the CND, as its proximity to 
the CND might suggest. 
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sition of the LG distribution, which can also be 
modeled with silicate grains. Substituting amor- 
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source flux. Fluxes extracted from the 70-um image are therefore treated as 
upper limits. (D to I) Best-fit DustEM models of the six regions. The blue 
dotted line and the red dashed line correspond to emission from VSG and LG 
distributions, respectively. Blue inverted triangles denote upper limits of the 
70-um flux measured by Herschel/PACS. Error bars indicate + 20% of the 
flux measured from Spitzer/IRAC and SOFIA/FORCAST images. Jy, jansky. 


Table 1. Assumed and derived dust properties of the dust SED model fit to the Sgr A East 
SNR dust. ase and als, are the amorphous carbon grain sizes assumed for the VSG and LG 
components, respectively, of the model, given in units of micrometers. The LG component for the 
model fit to the West region is composed of silicates, like the dust in the CND (24). Ty is the 
temperature of the LG distribution, d is the distance between the dust in the region and the central 
stellar cluster, Myo; is the total dust mass fit to the region, and Mysc/Mig is the mass ratio of the 
VSGs to the LGs. Errors provided are lc, as determined from the weighted linear least-squares fits. 


d (pc) Mrot(x1073M 5) Mvsc/Mic 
3.943 11% eneau eri 
4.3¢4 2.613 0.457908 
6.077 535 0.597005 
7.3172 2.91? 0.427008 
3.6103 0.9°33 0.157%2.0? 
4.408 9.2148 0.047001 


phous carbon for silicates in the models decreases 
the predicted total dust mass by only ~10%. How- 
ever, silicates for the VSG distribution are ruled 
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out, as they yield poor fits due to the steepness 
of the shorter-wavelength side of the 9.7-um fea- 
ture. Altering the size of the LG distribution by 
+0.02 um changes the total mass by only ~20%. 
Polycyclic aromatic hydrocarbons are also ruled 
out for the VSG composition, based on the ratio 
of the 7.7- and 8-um fluxes (23). The resulting 
SED dust models and the fitted parameters are 
shown in Fig. 4 and Table 1, respectively. 


Submillimeter excess and dust mass 


In our model fitting, we omit the 70-um flux to 
determine whether this flux is associated with 
the warm dust emission probed by the far-IR 
or is enhanced relative to the model due to the 
presence of a cooler dust distribution. Owing to 
the high background fluctuations at 70-um im- 
ages throughout the Galactic center, we assume 
that the extracted 70-um fluxes shown in the 
SEDs (Fig. 4) are upper limits. The absence of 
a '70-um excess indicates that it is unlikely to be 
associated with a molecular cloud along the line 
of sight toward the SNR. 

The dust model fit to the SED of the full re- 
gion covering the proposed SNR dust yields 
a total dust mass of ~0.021). 508 Mo (Mo, solar 
mass), where the LG component composes ~45% 
of the total mass. This mass estimate is consistent 
with the summed dust mass of the five sam- 
pled regions, excluding W (~0.013 Mo), as they 
cover approximately three-quarters of the an- 
gular size of the SNR dust emission. 


Sgr A East expansion and dust survival 
in a nonuniform-density medium 


The location, thermal structure, and SED of the 
dust indicate that it is very likely associated with 
the Sgr A East SNR. We describe a theoretical 
framework of the evolution of the SN ejecta to ex- 
plain the morphology, size composition, and ap- 
parent location of the surviving dust within the 
SNR. Our interpretation is based on comparing the 
ejecta-cooling time scales to the dust-destruction 
time scales due to thermal sputtering during the 
expansion of the ejecta into an asymmetric sur- 
rounding medium. 

Molecular line observations toward Sgr A East 
reveal the presence of dense molecular clouds that 
appear to be interacting with the SNR, especially 
at the northern and eastern edges of the rem- 
nant (9, 34). Newly condensed dust from the ini- 
tial explosion is assumed to be present uniformly 
throughout the SN ejecta, and the densities of 
the surrounding medium to the northeast (NE) 
and southwest (SW) are calculated as ~10° cm™? 
(14) and ~100 cm”, respectively. This ambient 
density gradient implies that there will be an 
asymmetry in the ejecta densities, as well as in 
the strength and speed of the forward and re- 
verse shock, which will influence the survival 
and destruction time scales of dust within the 
ejecta. After passage of the reverse shock, the 
density of the ejecta to the NE will therefore 
be ~10 times greater than the ejecta to the SW, 
and the shocked gas temperatures to the NE 
will be ~5 times lower than that to the SW. Hard 
x-ray observations of the Sgr A East ejecta re- 
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veal an electron temperature of T. ~ 2 x 107K 
and density of n. ~ 10 cm™ (14, 35). This hot 
ejecta can be associated with the diffuse SW 
expansion (Fig. 1B), whereas the NE ejecta, 
which is located at the projected position of the 
observed dust, will be unobservable in hard x-ray 
emission, given an initial temperature of T, ~ 
4 x 10° K and density of n. ~ 100 cm™. 

The time scale for complete destruction due to 
thermal sputtering for a@ = 0.04-um-sized grains 
is ~3000 years in the hot, diffuse region of the 
ejecta and ~1000 years in the cooler, dense region 
(23). These time scales do not include the erosion 
effects due to the kinetic sputtering that occurs 
when the SN-condensed dust initially encounters 
the reverse shock (3). These effects include calcu- 
lations beyond the scope of this paper; therefore, 
our estimates apply to dust that has survived 
the initial kinetic sputtering and can be treated 
as an upper limit on the dust lifetimes. Given the 
estimated age of the SNR, this destruction time 
scale implies that a large fraction of the LGs (a ~ 
0.04 um) and VSGs (a@ ~ 0.001 um) we adopt for 
our SED models will be destroyed. However, un- 
like the diffuse, hot SW ejecta, the dense, cooler 
ejecta at the NE undergoes substantial radia- 
tive cooling that occurs on time scales much 
shorter than the dust-sputtering lifetimes. 

In the metal-enriched environment of SN ejecta, 
iron will dominate the radiative cooling at tem- 
peratures greater than 10° K (36). Assuming that 
there is ~0.15 Mo of iron within the ejecta (35), 
with density in the NE 10 times that in the SW, 
and that the ejecta is in collisional ionization 
equilibrium in each region, we estimate the cool- 
ing time scale of the NE ejecta to be Teo) ~ 400 
years, which is much shorter than the estimated 
sputtering time scale. Assuming the NE ejecta 
cools to as low as To ~ 10° K, the destruction time 
scales are ~10° years for a = 0.04-.m-sized grains, 
which is sufficiently long—even for VSGs—to sur- 
vive within the ejecta. As the ejecta cools to even 
lower temperatures, the thermal sputtering be- 
comes negligible. Toward the SW, we find that 
the cooling time scale for the ejecta is t.9,) ~ 3 x 
10* years, consistent with the detection of hard 
x-rays there, given the estimated age of the SNR. 
Unfortunately, the high levels of extinction toward 
Sgr A East does not currently allow for obser- 
vations of optical cooling lines or soft x-ray 
emission to directly confirm our hypothesis. How- 
ever, the prominent spatial anticorrelation of 
the dust emission and the hard x-ray emission 
in Sgr A East strongly suggests that dust-survival 
conditions are much more favorable in cooler 
regions of the ejecta (see Fig. 1B). 

The SN-condensed dust is then injected into 
the surrounding ISM, with minimal erosion from 
thermal sputtering and nonthermal kinetic sput- 
tering. Assuming that the progenitor of Sgr A 
East was comoving with the surrounding me- 
dium, the relative velocity between the dust and 
the ISM can be estimated to be ~100 km s™, based 
on the age of the SNR (~10* years) and the ap- 
proximate distance traveled by the SN-condensed 
dust from the apparent center of the SNR (~1 pc). 
After the dust has been slowed from 100 km s* 


to 10 km s” by collisions with the surrounding 
ISM, the decrease in grain radius due to kinetic 
sputtering will be less than 10% (3, 37). 


Grain-grain collisions and the VSG-LG 
mass ratio 


The temperatures of the NE ejecta fall in a re- 
gime where grain-grain collisions become im- 
portant, if we assume that the thermal velocities 
of the dust grains are closely coupled to those of 
the gas. Unlike sputtering by ions, which erodes 
the grain atom by atom, grain-grain collisions 
are efficient at redistributing mass from larger 
(a ~ 0.1 um) to smaller (a < 0.005 um) grains 
by shattering or fragmentation (38). We attribute 
the enhanced VSG-LG mass ratio of the SNR dust 
derived from the SED models (Fig. 4 and Table 1) 
to grain-grain collisions, which occur between 
0.005- and 0.1-um grains over ~60 years (23). 
Therefore, grain-grain collisions and the fragmen- 
tation of larger grains into smaller grains occur 
on time scales shorter than the age of the SNR 
and the dust-sputtering lifetimes, which should 
lead to a substantial enhancement in the small- 
grain mass abundance relative to that of the inter- 
stellar medium, consistent with our models. 


SN dust mass-survival fraction 
and implications for galaxies 
of the early universe 


Our results show that, given a dense surround- 
ing environment, SN-condensed dust can sur- 
vive the destructive passage of the reverse shock 
to be injected into the ISM. The mass of the Sgr 
A East SNR dust provides an estimate of the 
mass-survival fraction of dust initially condensed 
in the ejecta. SNe dust-production models for a 
progenitor with a mass of 13 to 20 Mo predict 
that ~0.3 M.~ of dust forms in the ejecta (2, 3); 
therefore, our derived SNR dust mass of ~0.2 Mo 
implies that 7% of the total initial dust mass 
survived the passage of the reverse shock. This 
number is very uncertain, however, because esti- 
mates of dust masses produced in SNe are not 
well constrained, given the uncertainties in the 
microphysics that dictate grain formation. 

Excluding SN1987A, infrared observations of 
SNRs much younger than Sgr A East reveal that 
far smaller quantities of dust are formed in the 
ejecta than predicted by models (<0.1 Mo) [(39) 
and references therein]. If we assume that 0.1Mo 
of dust is initially formed in SNe, which is the 
quantity of dust detected in Cassiopeia A and the 
Crab Nebula (39), our results imply that 20% of 
the initial dust mass survives the reverse shock to 
be injected into the ISM. 

We apply our results to the dust-formation 
and -survival rates in the ISM of the galaxies in 
the early universe. In a scenario where the gal- 
axy’s star-formation history undergoes a single 
short and intense burst followed by a calm pe- 
riod with a much lower rate of star formation, 
a dust yield of ~0.15 Mg per SN is required to 
produce the large inferred quantities of dust (40). 
Our current results suggest that it is difficult to 
produce the observed dust mass in such gal- 
axies if only ~10 to 20% of the dust survives 
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the reverse shock. However, because stars in early- 
universe galaxies form in substantially denser 
regions than those in local galaxies [((4) and 
references therein], the dust mass-survival rate 
is probably greater than that which we infer. 
Additionally, if each SNe produced as much dust 
as observed in the ejecta of SN1987A (~0.5 Mo) 
(42, 43), SNe could reasonably account for the 
dust production. These findings are consistent 
with SNe being a dominant dust-production 
mechanism in galaxies of the early universe (7). 
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GALAXY EVOLUTION 


Isolated compact elliptical galaxies: 
Stellar systems that ran away 


Igor Chilingarian””* and Ivan Zolotukhin?® 


Compact elliptical galaxies form a rare class of stellar system (~30 presently known) 
characterized by high stellar densities and small sizes and often harboring metal-rich stars. 
They were thought to form through tidal stripping of massive progenitors, until two 
isolated objects were discovered where massive galaxies performing the stripping could 
not be identified. By mining astronomical survey data, we have now found 195 compact 
elliptical galaxies in all types of environment. They all share similar dynamical and stellar 
population properties. Dynamical analysis for nonisolated galaxies demonstrates the 
feasibility of their ejection from host clusters and groups by three-body encounters, which 
is in agreement with numerical simulations. Hence, isolated compact elliptical and isolated 
quiescent dwarf galaxies are tidally stripped systems that ran away from their hosts. 


alaxies are thought to form through the 

hierarchical merging of smaller building 

blocks into larger systems (J, 2), and the 

history of these interactions is imprinted 

in their observable properties. Some gal- 
axies, such as ultra-compact dwarfs (3) and com- 
pact ellipticals (cEs) (4-7), show evidence of strong 
tidal interactions with massive neighboring gal- 
axies (8) that stripped most of the stars from the 
compact galaxies’ progenitors. cEs are rare gal- 
axies with high stellar densities that resemble cen- 
ters of giant ellipticals but have masses that are 
about two orders of magnitude smaller [M ~ 10° 
solar mass (M.)]. They are found mostly in the 
cores of galaxy clusters next to massive central 
galaxies, which is in alignment with the above 
hypothesis for their evolution. 

The recent discoveries of isolated cE galaxies 
(9, 10) that do not belong to any galaxy cluster or 
group raised another round of debate about cE 
formation: whether they all formed through the 
tidal stripping, or through a different mechanism 
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of formation, such as mergers of dwarf galaxies 
with specific morphologies and configurations 
(10). Dwarf-dwarf galaxy mergers do happen in 
vicinities of massive galaixes (JI, 12). However, 
neither have they been observed in low-density 
environments, nor do any of the remnants re- 
semble properties of cE galaxies. The existence of 
a substantial number of isolated cEs will hence 
imply notably higher dwarf-dwarf merger rates 
than predicted by numerical simulations (13) 
and challenge the currently accepted hierarchi- 
cal structure formation paradigm. 

We demonstrated that all known cE galaxies 
are outliers from the universal optical-ultraviolet 
color-color-magnitude relation of galaxies (14). We 
could therefore perform a search for cE galaxies 
not only in the centers of rich clusters and groups 
as has been done before (5), but across all en- 
vironments using data from wide-field imaging 
surveys, the optical ground-based Sloan Digital 
Sky Survey [SDSS, (75)] and the ultraviolet all- 
sky survey carried out by the GALaxy Evolution 
eXplorer [GALEX, (J6)] spacecraft, which are all 
publicly available in the Virtual Observatory. 

First, we created an initial list of candidates 
(supplementary materials) from the sample of 
galaxies having spectra in the SDSS and, hence, 
known distances by selecting outliers above 
+0.035 mag in the optical (g - 7) color from the 
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universal relation (14). We chose low-luminosity 
galaxies [L < 4 x 10° solar luminosity (Zo) or 
absolute magnitude (M,) > —18.7 mag] that had 
small half-light radii (R, < 0.6 kpc) or were spa- 
tially unresolved in SDSS images; did not show 
substantial ellipticity, which was essential for 
removing edge-on spiral galaxies; had the red- 
shifts in the range of 0.007 < z < 0.08 (distances 
between 30 and 340 Mpc); and either possessed 
red near-ultraviolet colors [VUV — 7) > 4 mag] 
or remained undetected in the NUV band. We 
constrained by color and also removed objects that 
have emission lines in their spectra in order to 
exclude any objects with recent or ongoing star 
formation. 

We then fitted their SDSS spectra against a grid 
of stellar population models using the “NBursts” 
code (17) and obtained mean ages, metallicities, 
and velocity dispersions of their stars. We rejected 
candidates with stellar ages younger than 4 bil- 
lion years and introduced an additional constraint 
based on stellar velocity dispersions (o > 60 km/s). 
Stellar systems in equilibrium that are dynami- 
cally supported by random motions of stars, as 
most elliptical galaxies are, have their dynamical 
masses (M,;,), half-mass radii (R,), and global 
velocity dispersions (o,,) connected by the simple 
virial relation Myr = 9.96 Re ov'/G (18, 19). 
Therefore, for a galaxy with known velocity 
dispersion and a stellar mass (/.) derived from 
its luminosity and stellar population parameters, 
we can estimate the lower limit for the half-light 
radius (if a galaxy contains dark matter, its real 
half-light radius will be larger because M,j;, > 
M:). Hence, we can firmly reject physically 
extended objects such as “normal” dwarf ellipti- 
cal galaxies that are unresolved in SDSS images 
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Fig. 2. Comparison of 
stellar population 
properties of isolated 0.2 
and nonisolated 
compact elliptical 
galaxies to a 
reference sample of 0.0 
elliptical galaxies 
from the SDSS. Mean 
stellar ages and metal- 
licities of cE galaxies 
hosted by groups and 
clusters (round 
symbols) and isolated 
cEs (stars) were -0.4 
determined from the 
modeling of their SDSS 
spectra by using the 
NBursts full-spectrum -0.6 
fitting. The plotting 
symbols are color- 
coded according to the 
r-band absolute magni- 
tudes derived from the 
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Fig. 1. A sample of com- 
pact elliptical galaxies 
in all types of environ- 
ment. Our current sam- 
ple of compact elliptical 
galaxies (blue symbols) 
is compared with a data 
set compiled from the 
literature (red symbols). 
Dots and stars denote 
group/cluster and 
isolated compact ellipti- 

pn cal galaxies, respectively. 
; 0 Square panels in the 
bottom part of the figure 
show representatives of 
the current sample, and 
top corner insets display 
Messier 32 and NGC 
4486, prototypical cEs in 
the local universe, as they 
would look with the SDSS 
telescope at a 130 Mpc 
distance (z = 0.03). Each 
inset panel covers a 20 by 
20 kpc region centered 
ona cE. 


@ group/cluster cEs 
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SDSS photometry. We also computed ages and metallicities for a reference sample of 82,500 elliptical 
galaxies from SDSS DR7 in the same fashion, along with the median r-band absolute magnitude for 
every bin of age-metallicity parameter space. These magnitudes are shown as a background map, with 
the colors on the same scale as cE galaxies. This figure demonstrates that (i) stellar populations of isolated 
and group/cluster cEs do not differ statistically and (ii) cEs are on average much fainter than are normal 


elliptical galaxies of the same stellar age/metallicity. 


because of their large distances by selecting 
only objects with high velocity dispersions. 

In this fashion, we constructed a sample of 195 
galaxies (Fig. 1 and supplementary materials). We 
then cross-matched this list against the SDSS 
Galaxy Groups catalog (20) and established their 


group/cluster membership. For seven objects with- 
out counterparts in the Galaxy Groups catalog, we 
identified possible host galaxies (in most cases, 
group centers) located between 750 kpc and 
3.3 Mpc in projection. Because some bright and 
extended galaxies were missing from the SDSS 
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spectroscopic sample, and therefore also from the 
Galaxy Groups catalog, we used NASA/Infrared 
Processing and Analysis Center (IPAC) Extragalac- 
tic Database (NED) for the identification of host 
galaxies for 45 cEs. Our final sample contains 56 
galaxies in clusters, 128 in groups, and 11 isolated 
or field cE galaxies. Eight galaxies (supplemen- 
tary materials) exhibit prominent tidal streams 
similar to the two known cEs (8). 

Ages and metallicities (Fig. 2 and supplemen- 
tary materials), colors, and luminosities of 11 
isolated cE galaxies do not show a statistically 
significant difference from those of galaxies being 
members of clusters and groups. The Kolmogorov- 
Smirnov (KS) probabilities of these properties for 
isolated and nonisolated subsamples to be derived 
from the same underlying distribution range from 
30 (for luminosities) to 80% (metallicities). Our 
cE sample does not show any correlation between 
the metallicity and the stellar mass, conversely to 
normal elliptical galaxies, which exhibit a rather 
tight mass-metallicity relation (Fig. 2). 

For rich groups and clusters hosting cE gal- 
axies, we built caustic diagrams (27-23) that present 
differences of radial velocities of cluster members 
from the cluster center versus projected distances. 
A galaxy position on such a diagram reflects its 
dynamical status: objects sitting deep inside the 
cluster potential well are located inside the dis- 
tribution, whereas galaxies near the edges are 
barely gravitationally bound to the host cluster 
or infalling onto it for the first time. 

We constructed an ensemble cluster by nor- 
malizing individual cluster and group data by their 
velocity dispersions and sizes for 33 structures 
from our sample, each of which included over 20 
member galaxies (24, 25). Then, we computed its 
caustic diagram in order to visualize the phase 
space pattern of the infalling galaxy population 
and overplotted our cE galaxies on it (Fig. 3). The 
cE population strongly differs from other cluster 
members. The KS tests for projected distance and 
radial velocity distributions reject the hypothesis 
of cE and cluster member samples being derived 
from the same parent population at the 97 and 
98% levels. Numerical simulations of tidal strip- 
ping (5, 26) suggest that a progenitor galaxy, 
even if it approaches a cluster center on a very 
extended radial orbit, will lose a major fraction of 
its orbital energy because of dynamical friction, 
become gravitationally locked in the inner region 
of a cluster on a tightly bound orbit, and will 
finally be accreted by the host galaxy after a few 
billion years. Many cEs from our sample exhibit 
this behavior (Fig. 3 and supplementary mate- 
rials). However, we see a number of cE galaxies 
close to the edges of the caustics, suggesting that 
they are barely gravitationally bound to the cluster 
potential because they do not belong to the infall- 
ing population, as we demonstrated. This looks 
completely unrealistic in the case of a one-to-one 
galaxy encounter resulting in tidal stripping, but 
in the case of a three- or multiple-body encounter, 
this situation becomes significantly more likely. 

An interaction of binary stars with the central 
supermassive black hole is one accepted scenario 
for the creation of hypervelocity stars (27) in our 
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Galaxy: One of the binary components is ejected, 
whereas the other one falls onto the black hole. 
Numerical simulations suggest (28) that three- 
body encounters are responsible for putting Milky 
Way satellites on extreme orbits, going as far as 
3 Mpc away. Even though typical galaxy clusters 
have much wider and deeper potential wells than 
that of the Local Group, three- and even multiple- 
body encounters must happen much more fre- 
quently in those dense environments. Therefore, 
a certain probability exists that some of them will 
lead to the gravitational ejection of galaxies par- 
ticipating in the interaction to extreme radial orbits 
with the apocentric distances of a few megaparsecs 
(29). A three-body encounter that might eject a 
cE galaxy from its host cluster or group does not 
have to happen during the cE formation through 
tidal stripping; that is, two galaxies do not have to 
fall onto the cluster/group center at the same time. 
When acE progenitor is tidally stripped, it quick- 


Fig. 3. Positions of cE 
galaxies identified on 0.0 


ly settles on a tightly bound, rapidly decaying 
orbit (5), and if another galaxy infalls later, but 
before a newly formed cE has been accreted (hun- 
dreds of millions to a couple of billion years), the 
three-body encounter becomes possible. 

We estimate the probability of a close three- 
body encounter geometrically. Numerical simulations 
suggest (30) that over a typical cE lifetime of 2 
billion years (5), an average brightest cluster 
galaxy (BCG) must have experienced three or four 
mergers with massive (M. >~10'° M,) galaxies. 
We assume that (i) a typical cE resides on a qua- 
sicircular orbit within 7; ~ 120 kpc from a host 
BCG galaxy after correction for projection effects 
(fig. S4), (ii) galaxies infall on a BCG on radial 
orbits from random directions, and (iii) a three- 
body encounter will be sufficiently close if a cE 
passes within 73, ~ 20 kpc from a massive infall- 
ing galaxy. Hence, the probability is as a volume 
ratio of a cylinder of radius 73,, height rz, and a 
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Small stars denote cEs located deeply inside the potential wells of these six clusters, and large stars 
(with same colors as caustic lines) indicate cEs that have projected radial velocities of at least 85% 
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circled cEs are examples of galaxies barely gravitationally bound to their host clusters. 
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sphere of radius Tc, Psp = 3/4 (Tp/Ter)” * 0.02, or 
~6 to 8% for three or four merger events. 

In our sample of cluster and group cE galaxies, 
we indeed see numerous examples in which a cE 
resides only 20 to 80 kpc in projection from an 
ongoing major merger scene or several other mas- 
sive cluster members are visible in the cE vicinity 
apart from the massive central cluster/group gal- 
axy. Also, there is a known example of a globular 
cluster in the Virgo cluster (37) that was likely 
ejected at the speed of 2500 km/s and became 
gravitationally unbound to the cluster and its cen- 
tral galaxy, Messier 87. 

We conclude that the tidal stripping process 
can explain all observational manifestations of 
compact elliptical galaxies, including the forma- 
tion of isolated cEs whose existence was suggested 
as a strong counter-argument for tidal stripping 
(9). The ejection of cEs from central regions of 
galaxy clusters by three-body encounters is a chan- 
nel for these galaxies to survive for an extended 
period of time in the violent cluster environment, 
where they would otherwise be accreted by mas- 
sive hosts on a time scale of 2 billion to 3 billion 
years. The 11 isolated cEs probably represent a 
population of runaway galaxies that received suf- 
ficient kick velocities to leave their host clusters 
or groups forever. 

The gravitational ejection mechanism may also 
explain the very existance of extremely rare iso- 
lated quiescent dwarf galaxies (32), where the star 
formation quenching is usually explained by envi- 
ronmental effects. These systems are more spatially 
extended than cEs and do not exhibit substantial 
tidal stripping footprints. This suggests that they 
never came very close to cluster/group centers, 
and therefore, the three-body encounter proba- 
bility for them should be lower than that for cEs, 
although still nonnegligible. 
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A rhodium catalyst for single-step 
styrene production from benzene 


and ethylene 


Benjamin A. Vaughan,’ Michael S. Webster-Gardiner,' Thomas R. Cundari,”* T. Brent Gunnoe™* 


Rising global demand for fossil resources has prompted a renewed interest in catalyst 
technologies that increase the efficiency of conversion of hydrocarbons from petroleum and 
natural gas to higher-value materials. Styrene is currently produced from benzene and 
ethylene through the intermediacy of ethylbenzene, which must be dehydrogenated in a 
separate step. The direct oxidative conversion of benzene and ethylene to styrene could provide 
a more efficient route, but achieving high selectivity and yield for this reaction has been 
challenging. Here, we report that the Rh catalyst (F'DAB)Rh(TFA)(n2—C2H,) [F'DAB is N,N'- 
bis(pentafluorophenyl)-2,3-dimethyl-1,4-diaza-1,3-butadiene; TFA is trifluoroacetate] converts 
benzene, ethylene, and Cu(II) acetate to styrene, Cu(I) acetate, and acetic acid with 100% 
selectivity and yields =95%. Turnover numbers >800 have been demonstrated, with catalyst 


stability up to 96 hours. 


inyl arenes are important precursors for 
fine chemical synthesis, as well as for the 
preparation of plastics and elastomers 
(1-5). For example, styrene is produced 
globally on a scale of ~18.5 million tons (2). 
Current methods for the large-scale production 
of vinyl arenes involve multiple steps, typically 
beginning with arene alkylation using a Friedel- 
Crafts (e.g., AlCl, with HF) or zeolite catalyst 
followed by energy-intensive dehydrogenation 
of the alkyl group (Fig. 1) (-6). Friedel-Crafts ca- 
talysis suffers from the use of harsh acids, includ- 
ing HF, low selectivity for the monoalkylated 
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product (polyalkylation is inherent to the mech- 
anism), and the generation of stoichiometric 
waste (2). Zeolite catalysts have improved the 
process for benzene alkylation, yet these cat- 
alysts still require high temperatures (generally 
350° to 450°C) and give polyalkylated products 
(2, 7-10). 

An alternative method for the production of 
vinyl arenes is a direct and single-step oxidative 
arene vinylation (Fig. 1). If the terminal oxidant 
is oxygen from air (either introduced in situ or 
used to recycle a different in situ oxidant), the net 
reaction is the conversion of benzene, ethylene, 
and oxidant to styrene and water (12). Acid-based 
(i.e., Friedel-Crafts or zeolite catalysts) catalysis 
occurs by electrophilic aromatic substitution and 
does not offer a viable pathway to directly gen- 
erate vinyl arenes. Transition metal complexes 
that catalyze ethylene hydrophenylation by benzene 
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C-H activation followed by ethylene insertion into 
a metal-phenyl bond have been reported as alter- 
natives to acid-based catalysts (Fig. 2) (12-25). 
For these catalysts, B-hydride elimination from a 
M-CH,CH,Ph intermediate and dissociation of 
styrene provides a route for the direct oxidative 
vinylation of benzene (Fig. 2). 

Previously, our groups have studied the use of 
platinum(ID catalysts for the hydrophenylation 
of ethylene to produce ethylbenzene (16-19, 26-28). 
Through a combination of experimental and com- 
putational mechanistic studies, we discerned a 
competing B-hydride elimination pathway from 
Pt-CH,CH,Ph intermediates to form a Pt-styrene 
hydride complex, which can lead to the formation 
of free styrene (28). Unfortunately, the formation 


of styrene leads to catalyst decomposition (27). 
We proposed that this catalyst decomposition is 
the result of unstable Pt(II)-hydride complexes, 
which are formed from f-hydride elimination, 
that react to release Hy and produce metallic Pt. 
The thermodynamic driving force for the forma- 
tion of Pt(s) presents a substantial challenge to 
achieving long-lived vinyl arene production with 
these catalysts (Fig. 2, inset) (JD. Given that the 
formation and decomposition of Pt(II)-H species 
is problematic, we sought to design catalysts using 
isoelectronic Rh(1) in anticipation that Rh(I)- 
H would exhibit greater stability compared with 
related Pt species (Fig. 2, inset) (77). 

Figure 2 shows a targeted catalytic cycle for 
the direct oxidative vinylation of benzene to pro- 
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duce styrene. Despite precedent for the key steps 
in this catalytic cycle, designing a selective cata- 
lyst represents a substantial challenge because 
many competing side reactions (shown in red) 
are likely to have activation barriers that are 
similar to or lower than those of the reactions 
along the desired catalytic cycle. In addition to 
these possible side reactions, designing a molec- 
ular catalyst that achieves high turnover numbers 
(TONS) is difficult because the oxidative condi- 
tions and the presence of potentially reactive 
metal-hydride intermediates could be anticipated 
to result in catalyst decomposition. 

Table 1 compares previously reported homo- 
geneous catalysts for direct oxidative styrene syn- 
thesis from ethylene and benzene (29-34). Generally, 
all suffer from one or more of the following draw- 
backs: low selectivity, low yield, low TON, and/or 
use of oxidants that cannot be regenerated using 
oxygen. Notable results include the work of Hong 
and co-workers, who reported a Rhy(CO);, cata- 
lyst that gave, to our knowledge, the highest TON 
of styrene (472). In tandem with this process, 
liberated dihydrogen is consumed by two equiv- 
alents of ethylene and one equivalent of CO to 
produce 3-pentanone with 809 turnovers (TOs) 
(29). Sanford and co-workers reported that (3,5- 
dichloropyridyl)Pd(OAc), catalyzes styrene pro- 
duction with 100% selectivity and 6.6 TOs for 
styrene (33% overall yield) using PhCO,‘Bu, an 
oxidant that cannot be recycled with oxygen (33). 
Here, we report a rhodium catalyst for the selec- 
tive one-step production of styrene from benzene, 
ethylene, and Cu(II) salts. We chose a Cu(II) salt 
as the in situ oxidant because of industrial prec- 
edent for recycling reduced Cu(I) using oxygen. 
In the commercial Wacker-Hoechst process for 
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Fig. 2. Proposed cycle for transition metal-catalyzed styrene production from benzene and ethylene using CuX2 as an oxidant. The cuprous (CuxX) 
product could be recycled back to the cupric state using air, as shown at the upper left. Potential side reactions that a selective catalyst must avoid are shown 


in red. 
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ethylene oxidation (35, 36), use of oxygen to re- 
oxidize Cu(1) to Cu(II) has proven viable both in 
situ and in a second step (37). 

We recently reported the synthesis of an electron- 
deficient Rh(I) complex ("DAB)Rh(TFA)(1?-COE) 
["DAB is N,N*bis(pentafluorophenyl)-2,3-dimethyl- 
1,4-diaza-1,3-butadiene; TFA is trifluoroacetate; 
COE is cyclooctene] and demonstrated that this 
complex is an active catalyst for arene H/D ex- 
change in trifluoroacetic acid (38). Given that arene 


C-H activation is a key step in transition metal- 
catalyzed oxidative arene vinylation, we hypothe- 
sized that this Rh(1) complex might be an effective 
catalyst precursor for styrene production. Because 
the COE ligand would likely exchange for ethyl- 
ene, the ethylene analog (“DAB)Rh(TFA)(n?-CoH4) 
(1) was independently synthesized as our catalyst 
precursor (Fig. 3). 

Heating a 20-mL benzene solution of 1 [0.001 
mole percent (mol %) relative to benzene] with 


Table 1. Comparison of previously reported catalysts for styrene production. Selectivity is defined 
as the ratio of turnovers of styrene to total turnovers (all products) and is given as a percentage. Yield of 
styrene is reported relative to the limiting reagent. acac, acetylacetonate; DBM, dibenzoylmethane; DCP, 
3,5-dichloropyridine; HPA, H3PMo 20.0 : 30H20; TFA, trifluoroacetate. 
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Fig. 3. Synthesis of (F'IDAB)Rh(TFA)(n2-C2H,) (1) (RT, room temperature). 
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ethylene and Cu(OAc), (120 equivalents relative 
to 1) to 150°C affords 58 to 62 TOs of styrene after 
24 hours (for all TOs reported, two runs were 
performed, and both results are given). Samples 
of the reaction mixture were analyzed by gas 
chromatography-flame ionization detector (GC/ 
FID) using relative peak areas versus an internal 
standard (decane). This corresponds to quantitative 
yield based on the Cu(I]) limiting reagent. The 
calculated yield here assumes that two equiv- 
alents of Cu(II) are consumed to produce two 
equivalents of Cu(I) per equivalent of styrene. No 
other products were observed upon analysis of 
the reaction mixture by GC-mass spectrometry 
or GC/FID, indicating high selectivity for styrene 
production. Detection limits for the instruments 
were equivalent to ~1 TO of product. Specifically, we 
looked for evidence of stilbene, biphenyl, and vinyl 
acetate production, because these are the most 
commonly observed by-products in previously re- 
ported catalysis (Table 1). Control reactions with 
[Rh(u-TFA)(17°-CsH,4)s]o, a precursor to complex 1, 
afforded <5 TOs of styrene after 24 hours, with or 
without Cu(OAc),, highlighting the importance of 
the "DAB ligand. Control reactions with Cu(OAc)2 
alone also afforded no styrene formation. 

With a competent catalyst in hand, we next 
sought to optimize reaction conditions. The ef- 
fect of oxidant identity on catalysis with 1 was 
the first parameter investigated. Both soluble 
{copper 2-ethylhexanoate [Cu(OHex).] and cop- 
per pivalate [Cu(OPiv).]} and insoluble {copper 
acetate [Cu(OAc).] and copper trifluoroacetate 
hydrate [Cu(TFA).]} Cu(II) salts were screened. 
Figure S2 shows a plot of turnovers versus time 
for the various Cu(II) oxidants. Using a turnover 
frequency (TOF) calculated after 4 hours of re- 
action, soluble Cu(OHex), gives the fastest initial 
rate with a TOF of 2.8 x 10° s“, but the reaction 
does not reach 100% yield relative to oxidant 
until 28 hours, which may indicate that catalyst 
deactivation occurs. Cu(OAc), affords a slower 
initial rate than Cu(OHex)s, with a TOF of 2.8 x 
10-* 7 after 4 hours, but this oxidant provides a 
more stable catalytic process. Both Cu(TFA), and 
Cu(OPiv). afford slower initial rates; reactions 
with Cu(OPiv). reach 92% yield after 28 hours, 
whereas reactions with Cu(TFA), produce only 
19 TOs of styrene (32% yield) after 20 hours. 

To study catalyst longevity, we varied the amount 
of Cu(OAc)s. Between 60 and 240 equivalents 
(relative to 1), the yield of styrene relative to 
oxidant was always >95% (fig. S3). These near- 
quantitative yields demonstrate that the catalytic 
process using 1 as a precursor is stable and long- 
lived. For a reaction using 0.0001 mol % 1 and 
2400 equivalents of Cu(OAc)s, the catalyst remained 
active over a period of 96 hours and afforded a 
TON of 817 to 852. A plot of TO versus time shows 
that the Rh catalyst is stable through at least 96 
hours (fig. S1). The tolerance of 1 to a large excess of 
oxidant without any decrease in activity is promis- 
ing. The effect of temperature on catalysis was also 
examined (fig. S4). Generally, the rate of reaction 
increased with temperature; however, at 180°C, 
rapid catalyst deactivation led to a low TON (<10 
TOs) after 12 hours. Minimal activity (<1 TO) was 
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also observed at temperatures <100°C. The opti- 
mal temperature proved to be 150°C. 

We also observed that the reaction rate in- 
creased with increasing ethylene pressure. To de- 
termine the TOF, we measured TO after 4 hours 
of reaction. Figure 4 shows a plot of TOF versus 
ethylene pressure. A linear correlation is observed. 
Thus, the reaction rate appears to have a first- 
order dependence on ethylene concentration. This 
is in contrast to previously reported Pt(II) and 
Rud catalysts for the hydrophenylation of eth- 
ylene, which show an inverse dependence on 
ethylene pressure (14, 17). For the Pt and Ru cat- 
alysts, M(CH,CH.Ph)(n?-C.H,) complexes were 
identified as the likely catalyst resting states. 
The opposite dependence on ethylene pressure 
using 1 as catalyst precursor signals a likely change 
in the catalyst resting state. 

To gain further insight into the reaction mech- 
anism, we ran the reaction in a 1:1 molar mixture 
of CgHg and CgDg¢. After 1 hour, a ky/Xp (ratio of 
rate of reaction of protio-benzene and perdeutero- 
benzene) of 3.1(2) was determined by exam- 
ining the ratio of undeuterated styrene [mass/ 
charge ratio (m/z) = 104] to styrene-d; (m/z = 109) 
in the mass spectra from three independent ex- 
periments (fig. S5). After 2 hours, the observed 
isotope effect was 3.0(2), statistically equivalent 
to the data at 1 hour (fig. S5). Thus, the observed 
ky/kp of ~3.1 likely reflects a kinetic isotope ef- 
fect (KIE) for the catalytic cycle. The KIE is con- 
sistent with other transition metal-mediated C-H 
activation reactions. (39, 40) The primary KIE 
supports the hypothesis that a Rh catalyst is 
facilitating a metal-mediated C-H activation 
process, which occurs before or during the turnover- 
limiting step. No change in the isotopic distribu- 
tion for benzene was observed over the course 
of the reaction, and no styrene-ds.3 products were 
observed except those predicted by the natural 
abundance of deuterium in ethylene. 

Although more detailed studies are required 
to understand the reactivity profile of 1, we be- 
lieve that the highly electron-withdrawing per- 
fluorophenyl] groups on the FID AB ligand help 
suppress irreversible oxidation to inactive Rh(II]) 
in the presence of Cu(ID, possibly facilitate asso- 
ciative ligand exchange between free ethylene 
and coordinated styrene, and facilitate rapid eth- 
ylene insertion into Rh-Ph bonds. Challenges 
that remain for the continued development of 
this class of catalyst include increasing activity 
with the aim of achieving higher conversions 
of benzene. 


REFERENCES AND NOTES 


1. G.A. Olah, A. Molnar, Hydrocarbon Chemistry (Wiley, Hoboken, 
NJ, ed. 2, 2003). 

2. C. Perego, P. Pollesel, in Advances in Nanoporous Materials, 
E. Stefan, Ed. (Elsevier, Oxford, 2010), vol. 1, pp. 97-149. 

3. S.-S. Chen, in Kirk-Othmer Encyclopedia of Chemical 
Technology, A. Seidel, M. Bickford, Eds. (Wiley, Hoboken, NJ, 
2000), pp. 325-357. 

4. H.A. Wittcoff, B. G. Reuben, J. S. Plotkin, in Industrial 
Organic Chemicals (Wiley, Hoboken, NJ, 2004), pp. 100-166. 

5. Process Evaluation/Research Planning (PERP) Program 
Report, Styrene/Ethylbenzene, (PERP Report 91-9, Chem 
Systems, Inc., New York, 1992). 

6. M. Lucchini, A. Galeotti, “Improved process for the 
dehydrogenation of alkyl-aromatic hydrocarbons for the 


424 24 APRIL 2015 * VOL 348 ISSUE 6233 


production of vinyl-aromatic monomers” (International Patent 
W02007073918A1, 2007). 

7. C. Perego, P. Ingallina, Catal. Today 73, 3-22 (2002). 

8. C. Perego, P. Ingallina, Green Chem. 6, 274 (2004). 

9. J. ejka, B. Wichterlova, Catal. Rev. 44, 375-421 (2002). 

0. |. M. Gerzeliev, S. N. Khadzhiey, |. E. Sakharova, Petrol. Chem. 
51, 39-48 (2011). 

1. CRC Handbook of Chemistry and Physics (CRC Press, 
Boca Raton, FL, 1977). 

2. M. Lail, B. N. Arrowood, T. B. Gunnoe, J. Am. Chem. Soc. 125, 
7506-7507 (2003). 

3. M. Lail et al., Organometallics 23, 5007-5020 (2004). 

4. N. A. Foley, J. P. Lee, Z. Ke, T. B. Gunnoe, T. R. Cundari, 
Acc. Chem. Res. 42, 585-597 (2009). 

5. E. E. Joslin et al., Organometallics 31, 6851-6860 (2012). 

6. J.R. Andreatta, B. A. McKeown, T. B. Gunnoe, J. Organomet. Chem. 
696, 305-315 (2011). 

7. B.A. McKeown et al., J. Am. Chem. Soc. 133, 19131-19152 (2011). 

8. B. A. McKeown, H. E. Gonzalez, T. B. Gunnoe, T. R. Cundari, 
M. Sabat, ACS Catal. 3, 1165-1171 (2013). 

9. B. A. McKeown, B. M. Prince, Z. Ramiro, T. B. Gunnoe, 
T. R. Cundari, ACS Catal. 4, 1607-1615 (2014). 

20. S. A. Burgess et al., Chem. Sci. 5, 4355-4366 (2014). 

21. W. D. Jones, J. A. Maguire, G. P. Rosini, Inorg. Chim. Acta 270, 
77-86 (1998). 

22. A. T. Luedtke, K. |. Goldberg, Angew. Chem. Int. Ed. 47, 
7694-7696 (2008). 

23. T. Matsumoto, D. J. Taube, R. A. Periana, H. Taube, H. Yoshida, 
J. Am. Chem. Soc. 122, 7414-7415 (2000). 

24. T. Matsumoto, R. A. Periana, D. J. Taube, H. Yoshida, 
J. Mol. Catal. A 180, 1-18 (2002). 

25. J. Oxgaard, R. P. Muller, W. A. Goddard 3rd, R. A. Periana, 
J. Am. Chem. Soc. 126, 352-363 (2004). 

26. B. A. McKeown, N. A. Foley, J. P. Lee, T. B. Gunnoe, 
Organometallics 27, 4031-4033 (2008). 

27. B. A. McKeown et al., Organometallics 32, 3903-3913 (2013). 

28. B. A. McKeown et al., Organometallics 32, 2857-2865 (2013). 

29. P. Hong, H. Yamazaki, J. Mol. Catal. 26, 297-311 (1984). 

30. D. Taube, R. Periana, T. Matsumoto, “Oxidative coupling 
of olefins and aromatics using a rhodium catalyst and a 
copper(II) redox agent” (U.S. Patent 6127590A, 2000). 


3b. 


= 


Y. Fujiwara, |. Noritani, S. Danno, R. Asano, S. Teranishi, 

J. Am. Chem. Soc. 91, 7166-7169 (1969). 

32. T. Yamada, A. Sakakura, S. Sakaguchi, Y. Obora, Y. Ishii, 

New J. Chem. 32, 738 (2008). 

33. A. Kubota, M. H. Emmert, M. S. Sanford, Org. Lett. 14, 
1760-1763 (2012). 

34. K. Sasaki, T. Sakakura, Y. Tokunaga, K. Wada, M. Tanaka, 
Chem. Lett. 17, 685-688 (1988). 

35. American Chemical Society, Chem. Eng. News Archive 39 (16), 
52-55 (1961). 

36. W. A. Herrmann, in Catalysis from A to Z, B. Cornils, W. A. Herrmann, 
M. Muhler, C.-H. Wong, Eds. (Wiley, Weinheim, Germany, 2007), 
pp. 1512-1524. 

37. M. Eckert, G. Fleischmann, R. Jira, H. M. Bolt, K. Golka, in 
Ullmann’s Encyclopedia of Industrial Chemistry (Wiley-VCH 
Verlag, Weinheim, Germany, 2000), pp. 1-17. 

38. M. S. Webster-Gardiner et al., Cat. Sci. Tech. 5, 96-100 (2015). 

39. M. Gomez-Gallego, M. A. Sierra, Chem. Rev. 111, 4857-4963 (2011). 

40. W. D. Jones, Acc. Chem. Res. 36, 140-146 (2003). 


ACKNOWLEDGMENTS 


The authors acknowledge support from the U.S. Department of 
Energy, Office of Basic Energy Sciences [DE-SCO000776 (T.B.G.) 
and DE-FG02-03ER15387 (T.R.C.)] for studies of styrene catalysis; 
the Center for Catalytic Hydrocarbon Functionalization, an Energy 
Frontier Research Center (award DE-SCO001298), which funded 
the initial catalyst discovery; and an AES Corporation Graduate 
Fellowship in Energy Research (M.S.W.-G). The authors also thank 
B. McKeown, G. Fortman, S. Kalman (University of Virginia), and 
R. Nielsen (California Institute of Technology) for helpful discussions. 


SUPPLEMENTARY MATERIALS 
www.sciencemag.org/content/348/6233/421/supp|/DC1 
Materials and Methods 

Figs. Sl to S4 

Table S1 

References (41-43) 


3 November 2014; accepted 12 March 2015 
10.1126/science.aaa2260 


SELF-ASSEMBLY 


Selective assemblies of giant 
tetrahedra via precisely controlled 
positional interactions 


Mingjun Huang,’ Chih-Hao Hsu,' Jing Wang,' Shan Mei,' Xuehui Dong,’ Yiwen Li,’ 
Mingxuan Li,’ Hao Liu,’ Wei Zhang,’ Takuzo Aida,” Wen-Bin Zhang,°?* 


Kan Yue,'* Stephen Z. D. Cheng'* 


Self-assembly of rigid building blocks with explicit shape and symmetry is substantially 
influenced by the geometric factors and remains largely unexplored. We report the selective 
assembly behaviors of a class of precisely defined, nanosized giant tetrahedra constructed by 
placing different polyhedral oligomeric silsesquioxane (POSS) molecular nanoparticles at the 
vertices of a rigid tetrahedral framework. Designed symmetry breaking of these giant 
tetrahedra introduces precise positional interactions and results in diverse selectively 
assembled, highly ordered supramolecular lattices including a Frank-Kasper A15 phase, which 
resembles the essential structural features of certain metal alloys but at a larger length scale. 
These results demonstrate the power of persistent molecular geometry with balanced 
enthalpy and entropy in creating thermodynamically stable supramolecular lattices with 
properties distinct from those of other self-assembling soft materials. 


elf-assembled hierarchical structures in 
soft materials have been intensely studied. 
Among them, assemblies of building blocks 
with specific geometric shapes and sym- 
metry are of particular interest. As the sim- 


plest case, ordered structures constructed from 
packing of spherical motifs have been a classic yet 
dynamic research field that can be traced back to 
the study of metals and metal alloys. Most metal 
atoms, viewed as congruent spheres, typically tend 
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to hold 12 neighbors (the coordination number, 
CN, is thus 12) in local environments, forming 
the most efficient packing scheme with tetrahedral 
interstices (1). This type of structure allows three 
possible variations: face-centered cubic (the cub- 
octahedron), hexagonal close-packed (the twinned 
cuboctahedron), and the topologically close-packed 
icosahedron (or “icosahedral coordination”). 

In metal alloys, different metal atoms with 
various sizes and electronic states are involved. 
Frank and Kasper (2) studied the stability of 
icosahedral lattices and proved that distorted 
icosahedra could be accommodated with topo- 
logically close-packed Kasper polyhedra, which 
allow even higher coordination numbers (CN = 
14, 15, and 16) in metal alloy crystals. This class 
of metal alloy crystal structures is referred as the 
“Frank-Kasper” phases, including the Al5 phase 
(with an A3B stoichiometry such as Cr3Si), the 
Friauf-Laves phase (with an A,B stoichiometry 
such as Zn,Mg), the o phase (with an AB stoi- 
chiometry such as CrFe), and others (3). Some 
Frank-Kasper phases are viewed as periodic ap- 
proximates of aperiodic “quasicrystals.” There- 
fore, they provide a platform to understand how 
to fill in space with different spherical motifs and 
how to achieve properties related to their char- 
acteristic structural features of low lattice sym- 
metry and high coordination numbers. 

A typical cubic unit cell of the A15 phase 
(Fig. 1A) consists of six.A units (pale red spheres) 
in 14-fold Kasper polyhedra and two B units 
(dark red spheres) in 12-fold icosahedral coordi- 
nation (Fig. 1B) with a space group of Pm3n (O}). 
The projection view along the (001) direction 
(Fig. 1C) displays a regular two-dimensional (2D) 
4* tiling pattern (4). Recently, examples of the 
A15, o, and quasicrystalline phases constructed by 
nano- and micrometer-sized “deformable” spheres, 
micelles, and colloids were reported in many 
systems, including spherical dendrimers (5-8), 
ABC star-triblock copolymers (9), micelles of lin- 
ear diblock or tetrablock copolymers in the bulk 
(10-12) or in solution (73), binary nanoparticle 
lattices (J4), and mesoporous silica produced 
from surfactant micelles (15). In particular, for- 
mation of the Al5 phase in dendrimers has been 
attributed to the presence of soft “squishy surface 
layers” composed of alkyl chains, which can de- 
form to minimize steric interactions (5, 16) and 
surface contact area among the spheres (the 
Weaire-Phelan structure) (17, 18). 

Constructing ordered phases with the use of 
shaped building blocks other than spheres has 
yet to be demonstrated. Recent computer simu- 
lation results revealed possible crystalline and 
liquid crystalline structures from the packing 
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of polyhedra (79). Among all the polyhedra, the 
tetrahedron is the simplest. Rigid tetrahedron 
building blocks have been shown to form quasi- 
crystalline and crystalline phases with high pack- 
ing fractions (20). However, related experimental 
investigation remains largely unexplored in terms 
of both observations of ordered structures and 
their formation mechanisms. Shape-persistent 
molecular nanoparticles, such as derivatives of 
POSS (21), fullerenes (22), polyoxometalates (23), 
and proteins (24), offer great opportunities to 


construct nanosized giant tetrahedra with atomic 
precision (25, 26). 

Here, we present an experimental study of 
giant tetrahedra constructed by attaching four 
POSS cages with different functional groups to 
a rigid tetrahedral core (Fig. 2). They are distin- 
guished from the reported dendrimer and block 
copolymer systems (72) by the absence of any 
flexible alkyl or polymeric chains. Self-assembly 
of these giant tetrahedra is mediated by inter- 
actions among the POSS nanoclusters and the 
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Fig. 1. Schematic illustration of the A15 phase. (A) In an A15 cubic unit cell, the dark red and pale red 
colors represent different coordination environments. (B) Schemes of CN = 12 and CN = 14 coordination 
environments in the A15 lattice. (©) 2D-projected view of the A15 lattice along the (001) direction. The 
inset shows a 2D 4° tiling pattern along the z axis. The spheres at the sparse layers (z/4 and 3z/4) are 
represented by gray circles; the spheres at the dense layers are shown by black and white circles (z/2 


and z). 


R = -CH,COOH 3a 
-CH,CH,COOH 3b 


R = -CH,COOH 2a 

-CH,CH,COOH 2b 
-CH,CH(OH)CH,(OH) 2c 
-CH,CH,OH 2d 


R = -CH,COOH 4a 
-CH,CH,COOH 4b 


& R: Hydrophilic functional group 


Fig. 2. Chemical structures and molecular models (shown in shadow) of the four categories of 
giant tetrahedra. Cartoons in the boxes are corresponding simplifications of the giant tetrahedra, in which 
blue spheres represent hydrophilic POSS cages and red spheres represent hydrophobic BPOSS cages. 
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overall molecular symmetry. Various ordered su- 
pramolecular lattices, including the Frank-Kasper 
Al5 phase, are observed in this system by tuning 
the numbers of hydrophilic or hydrophobic POSS 
cages in each molecule and the functional groups 
on the hydrophilic POSS cages. 

Giant tetrahedra 1 to 4 with different parti- 
tions of hydrophobic and hydrophilic POSS cages 
were synthesized by sequentially applying two 
“click” reactions (fig. S1): the copper-catalyzed 
azide-alkyne [3+2] cycloaddition reaction and 
the thiol-ene reaction (27, 28). The hydrophobic 
POSS cages have seven isobutyl groups (BPOSS) 
and the hydrophilic POSS cages have either hydroxyl 
or carboxylic acid groups (Fig. 2). Incorporation 
of different POSS cages results in competing in- 
teractions (i.e., collective hydrogen-bonding inter- 
actions among the hydrophilic POSS cages and 
the crystallization of BPOSS cages) to drive self- 
assembly; tuning the number of hydrophobic or 
hydrophilic POSS cages systematically varies mo- 
lecular symmetry of the giant tetrahedra. Nu- 
clear magnetic resonance and mass spectroscopy 
results (figs. S2 and S3) confirmed their struc- 
tural precision and high purity. We expect that 
geometric and interactional factors jointly deter- 
mine their self-assembly behaviors. 

Giant tetrahedron 1 contains four identical 
BPOSS cages. A crystalline structure with a tri- 
clinic unit cell and a space group of PI has been 
determined (fig. S4 and table S1), based on the 
combination of selected-area electron diffraction 
(SAED; fig. S4A) data from its single crystals and 
wide-angle x-ray diffraction (WAXD; fig. S4B) 
data from the bulk sample (28). In the simulated 
molecular packing, the tetrahedral cores adopt 
an interpenetrated stacking manner to form geo- 
metrically locked columns, which are surrounded 
by a shell of crystalline BPOSS cages (fig. S4, D and 
E). To maximize the contacts among the crystal- 
line BPOSS cages, the lattice is distorted from hex- 
agonally packed cylinders toward lower symmetry. 

Replacing one BPOSS cage with a hydrophilic 
POSS cage in 1 lowers the molecular symmetry to 
C3v and results in giant tetrahedra 2a to 2e. At 
25°C, density-frustrated lamellar supramolecular 
structures with a three-layer packing periodicity 
are observed in 2a to 2c, as supported by the 
combined small-angle x-ray scattering (SAXS) 
and WAXD results (Fig. 3A) with a scattering 
vector (q) value ratio of 1:2:3. Besides, the strongest 
diffraction peak at 1.09 nm in the WAXD pattern 
is attributed to the characteristic diffraction of crys- 
talline BPOSS domains (29). A bright-field (BF) 
transmission electron microscope (TEM) image 
of microtomed thin-sectioned 2a samples (Fig. 3B) 
and its fast Fourier transform (FFT) pattern (Fig. 
3B, inset) also confirm the lamellar structure. The 
measured periodicities of 4.3 to 4.7 nm (Table 1) 
can only accommodate two layers of BPOSS and 
one interdigitated layer of the hydrophilic POSS 
cages (Fig. 3G) (estimated ~4.5 nm). Despite the 
unmatched numbers of hydrophobic and hydro- 
philic POSS cages, crystallization of BPOSS cages 
dominates and preferentially creates flat inter- 
faces (30), leading to the formation of frustrated 
supramolecular lamellae. 
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Fig. 3. Selectively assembled structures of 2a. (A) Combined SAXS and WAXD profiles of 2a 
evaporated from tetrahydrofuran-acetonitrile (THF/MeCN) mixed solvents at 25°C. (B) BF TEM image and 
corresponding FFT pattern (inset) of a microtomed thin-sectioned 2a sample. (C) SAXS pattern of 2a after 
the sample was heated to above its 7,, and annealed at 140°C for 12 hours. (D) A {100} plane of an Al5 
supramolecular lattice was identified by the BF TEM image after the thin-sectioned sample was stained by 
RuO,. The inset is the FFT pattern of this image. (E) Fourier filtering of the image shown in (D) revealed a 
clear view of the 2D 4% tiling along the (100) direction. (F) Inverse colored and magnified image of (E). White 
spheres represent the hydrophilic POSS domains with different sizes. The inset shows a simulated 
projection view along the (100) direction. Spheres in the red-dot circles correspond to the dark red ones 
shown in Fig. 1A. (G@) Schematic illustrations of the selective assembly mechanism and molecular packing 
in the A15 lattice. 
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Table 1. Supramolecular lattice analysis of the giant tetrahedra with different symmetry. Lattice | 
structures were formed by slow evaporation of the sample solutions in THF/MeCN mixed solvents at 25°C; 
lattice Il structures were formed after annealing treatment. d, is the determined periodicity of the lamellar 
structures. 


Molecule Lattice | Tm (°C) = d, (nm) Lattice Il dj, (nm) M 

1 Triclinic crystal 138 _ Disordered = _ 
2a Lamellae 145 4.50 A15 13.2* 38/508 
2b Lamellae 144 4.60 Al5 14.2* 46/618 
2c Lamellae 146 4.67 Al5 14.6* 49/658 
2d Lamellae 126 4.50 bec 8.2* 448 
3a Lamellae 177 5.40 Double gyroid 6.35 = 
3b Lamellae 172 5.90 Lamellae 6.40t _ 
3c Lamellae 180 6.00 Lamellae 7307 - 
4a Disordered - — Cylinder 6.22% 6.7# 
4b Disordered - - Cylinder 6.28% 6.6% 
Ac Disordered — — Cylinder 6.70* 74# 


*Dimensions of the A15 or bcc unit cells. tLamellar periodicities in supramolecular lattice Il. {The corresponding 
principal (01) spacing between the cylinders in the honeycomb-like hexagonal lattices. §Calculated numbers of 
giant tetrahedra in two types of spheres in the A15 lattice (the size ratios of two types of spheres in 2b and 2c are 
assumed to be 1.1, the same as in 2a) or in each sphere in the bcc lattice. #Average number of giant tetrahedra 
within 1-nm-thick cross section of the cylinders in the honeycomb-like hexagonal supramolecular lattices. 
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After 2a was heated to 180°C (above its melting 
point 7,,; Table 1) and immediately cooled to and 
annealed at 140°C for 12 hours, an entirely 
different SAXS pattern was observed (Fig. 3C). 
The WAXD pattern indicates that BPOSS cages 
were amorphous in this structure (fig. S5A). Both 
2b and 2c exhibited virtually identical SAXS 
patterns upon the same thermal treatment (fig. 
S5, B and C). The observed g value ratios (Fig. 2C 
and fig. S5, B and C) are \/2:\/4:\/5:\/6, which is 
characteristic of the A1l5 phase (5). A cubic unit 
cell with a = 13.2 nm can be deduced for 2a. The 
lattice assignment is further validated by TEM 
images of the microtomed, RuO,-stained thin- 
sectioned samples of 2a (~80 nm thick). The BF 
TEM image in Fig. 3D exhibits the arrangement 
of spheres along the (100) direction of the A15 
phase in real space. Its FFT pattern is shown in 
the inset of Fig. 3D (also in fig. S5D) with major 
diffractions assigned. Fourier filtering treatment 
provides a clear view of the regular 2D 4* tiling 
pattern along the (100) direction (Fig. 3E). From 
this image, the measured distance between two 
closest neighboring squares is 13.2 nm, which is 
consistent with the value calculated from the 
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SAXS result. Setting Fig. 3E in inverse contrast 
makes it easier to identify the fine features of the 
spherical packing (Fig. 3F). It is observed that 
spheres in the red-dot circles (Fig. 3F), which cor- 
respond to the dark red spheres in Fig. 1A with 
CN = 12, are smaller relative to their neighbors 
(pale red spheres in Fig. 1A with CN = 14). On the 
basis of the average size ratio between these two 
types of spheres (1.1 + 0.06), we estimate that 
these two types of spheres contain 38 and 50 
giant tetrahedra, respectively (28). These results 
support the existence of two types of spheres with 
different coordination environments in a single- 
component system, in contrast to metal alloys 
with different types of atoms. Moreover, the 
number of giant tetrahedra in each sphere is 
found to increase with increasing strength of the 
collective hydrogen-bonding interactions and 
the molecular masses from 2a to 2e (Table 1). 
The formation mechanism of the A15 phase is il- 
lustrated in Fig. 3G. When the frustrated lamellar 
crystals melt, the hydrophilic POSS cages form 
spherical aggregates via collective hydrogen bond- 
ing, while BPOSS cages originally located in the 
neighboring top and bottom lamellar layers un- 
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Fig. 4. Selectively assembled structures from giant tetrahedra 3 and 4. (A to C) SAXS patterns of 
3c (A), 3a (B), and 4b (C) were taken at 25°C after corresponding thermal treatments. (D) BF TEM image 
of thin-sectioned 3¢ confirms the lamellar lattice deduced from the SAXS result shown in (A). (E) BF TEM 
image of thin-sectioned and RuO,-stained 3a confirms the double-gyroid lattice deduced from the SAXS 
result shown in (B). (F) BF TEM image of thin-sectioned 4b confirms the honeycomb-like hexagonal lattice 
deduced from the SAXS result shown in (C). In (D) to (F), the insets are the FFT patterns of the TEM 
images. (G) Schematic illustration of the selective assembly mechanisms and packing models of 3a to 3c. 


(H) Schematic packing models of 4a to 4c. 
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dergo a 2D scrolling to form the shell. The self- 
assembled spheres finally pack into the A15 
supramolecular lattice. 

Dendrimers with a poly(benzyl ether) core and 
a dodecyl corona are known to form spheres that 
further pack into A15 lattices (5, 7), which can be 
explained by the soft “squishy surface layers” 
that promote deformation of the spheres to maxi- 
mize entropy and minimize interfaces (17, 18, 31, 32). 
The molecular geometry of giant tetrahedra 2a 
to 2e also prefers the formation of spheres in 
the first step. Without any flexible chains, it is 
proposed that extra degrees of freedom (such as 
the excluded volume of BPOSS cages and the 
nonclose packing of the hydrophilic POSS cages 
via hydrogen bonding) contribute to the size dif- 
ferentiations of the assembled spherical motifs, 
which entropically favor more space and looser 
packing to form the A15 phase. Furthermore, it is 
believed that the deformability is associated with 
the size of the spheres, because the interstitial 
gaps become larger as the size of spheres in- 
creases (16). 

To prove this assumption, we synthesized 2d 
(Fig. 2 and fig. S6A) containing a hydrophilic 
POSS cage with the weakest hydrogen-bonding 
interaction and the smallest molar mass among 
2a to 2d. After similar thermal treatment, a body- 
centered cubic (bcc) lattice composed of only one 
type of spheres was found (fig. S6). Each sphere 
contains 44 giant tetrahedra 2d. This number 
provides a reasonable estimation of the upper 
size limit of nondeformable spheres assembled 
from this series of giant tetrahedra, because a 
small fraction of the Al15 phase can also be iden- 
tified from the TEM image of thin-sectioned 2d 
samples (fig. S6F). Any spheres larger than this 
size would deform as the result of nonclose pack- 
ing of the hydrophilic POSS cages at the spherical 
center (33) to better fit into the supramolecular 
lattice-packing requirements with lower symmetry. 

Giant tetrahedra 3a to 3c are more symmetric 
in terms of both volume fractions and interac- 
tions. “Double-layered” lamellar supramolecular 
lattices (30, 34) are observed for 3a to 3e at 25°C, 
due to the dominating BPOSS crystallization (fig. 
S7A). Layer thicknesses of these lamellar struc- 
tures were determined from SAXS results (Table 
1 and fig. S7B), and they match the estimated 
values from molecular packing models (28). Their 
high-temperature structures were obtained by 
annealing above their 7,, at 180°C for 3 hours 
and subsequent quenching into liquid nitrogen 
to suppress crystallization of BPOSS cages (fig. 
S7C). SAXS and TEM results (Fig. 4, A and D) 
indicate that the lamellar structures of 3b and 3c 
remain but have increased lamellae d-spacings 
relative to their room-temperature structures; 
this is mainly attributed to the disordered BPOSS 
packing and thermal expansion. On the other 
hand, a highly ordered double-gyroid supra- 
molecular lattice (space group Ja3d) forms in 
8a after similar treatment (Fig. 4, B and E, and 
fig. S7D). In the TEM image (Fig. 4E), the darker 
regions are hydrophilic POSS domains embedded 
in the hydrophobic matrix composed of BPOSS 
cages and the tetrahedral cores. Formation of 
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such a double-gyroid phase from the rigid and 
symmetric giant tetrahedron 3a reflects the ubiq- 
uity of the gyroid structure, implying the subtle 
influence of the slightly different volume frac- 
tions and interactions on the selective assembly 
of these giant tetrahedra (Fig. 4G). 

Giant tetrahedra 4.a to 4c failed to crystallize 
in similar solvent evaporation processes because 
of the low volume fraction of BPOSS cages that 
does not favor the formation of continuous 2D 
flat crystals (fig. S8). At such a volume fraction, 
an inverse spherical phase such as bec or A15 was 
expected. However, after thermal annealing at 
130°C, only ordered hexagonal cylinder phases 
were observed in 41a to 4e, as revealed by the q 
value ratio of LV3:V/4 in their SAXS patterns 
(fig. S8B, Fig. 4C, and fig. S8C for 4a, 4b, and 4c, 
respectively) and the honeycomb-like hexagonal 
structure observed in BF TEM images (Fig. 4F). 
In the proposed schematic packing model of 4a 
to 4c (Fig. 4H), BPOSS cages are wrapped into 
centers of the columns while hydrophilic POSS 
cages with strong collective hydrogen bonding 
form the continuous matrix. In sharp contrast to 
the packing of 2a to 2c at higher temperatures, 
4a to 4c tend to maximize the contacts of hy- 
drophilic POSS cages (and thus the extent of 
collective hydrogen-bonding formation), which 
substantially minimizes the overall free energy of 
the system. 

Symmetry breaking on accurately controlled 
positional interactions of nanosized giant tetrahe- 
dra has been used to construct the Frank-Kasper 
Al5 phase and other ordered supramolecular 
lattices. The diverse self-assembly behaviors of 
these giant tetrahedra reveal that rigid, single- 
component soft-matter systems offer potential 
for building supramolecular “metal alloy analogs.” 
The subtle competition between the persistent 
molecular geometry and the deformability driven 
by interaction terms dictates the selective assem- 
bly of the giant tetrahedra. Because of the “click” 
synthesis, this system is highly tunable in terms 
of core structure, nanoparticle functionality, and 
feature size. The concepts and formation mecha- 
nisms of these supramolecular structures could 
be extended to other giant-polyhedra molecules 
with different topologies and chemical compositions. 
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METHANE CYCLING 


Nonequilibrium clumped isotope 
signals in microbial methane 
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Lucy C. Stewart,” James F. Holden,” Alexander N. Hristov,® John W. Pohlman,’ 

Penny L. Morrill,® Martin Kénneke,* Kyle B. Delwiche,? Eoghan P. Reeves,’ 
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Methane is a key component in the global carbon cycle, with a wide range of anthropogenic 
and natural sources. Although isotopic compositions of methane have traditionally aided 
source identification, the abundance of its multiply substituted “clumped” isotopologues 
(for example, 3CH3D) has recently emerged as a proxy for determining methane-formation 
temperatures. However, the effect of biological processes on methane’s clumped 
isotopologue signature is poorly constrained. We show that methanogenesis proceeding 
at relatively high rates in cattle, surface environments, and laboratory cultures exerts kinetic 
control on 72CH3D abundances and results in anomalously elevated formation-temperature 
estimates. We demonstrate quantitatively that Hz availability accounts for this effect. Clumped 
methane thermometry can therefore provide constraints on the generation of methane in 
diverse settings, including continental serpentinization sites and ancient, deep groundwaters. 


arbon (2C/"C) and hydrogen (D/H) iso- 
tope ratios of methane are widely applied 
for distinguishing microbial from thermo- 
genic methane in the environment (J-7), as 
well as for apportioning pathways of mi- 
crobial methane production (8-10). This bulk 
isotope approach, however, is largely based on 
empirical observations, and different origins of 
methane often yield overlapping characteristic 
isotope signals (3, 7, 17-13). Beyond conventional 
bulk isotope ratios, it has become possible to pre- 
cisely measure the abundance of multiply sub- 
stituted “clumped” isotopologues (e.g., "CH3D) 
(14, 15). In particular, the abundance of clumped 
isotopes makes it possible to obtain information 


about the temperature at which C-H bonds were 
formed or last equilibrated (14) (fig. S1). Formation 
temperatures of both thermogenic and microbial 
methane in natural gas reservoirs can be estimated 
on the basis of clumped isotopologues (16). The 
mechanisms by which isotopologues attain distri- 
butions consistent with thermodynamic equi- 
librium, however, remain unclear because bulk 
methane isotopes (8'°C and SD) often reflect 
kinetic isotope fractionations (13, 17), and H 
isotope exchange between methane and water 
is sluggish (18). 

To test whether clumped methane thermom- 
etry can be widely applied for methane sources 
beyond natural gas reservoirs, we examined 
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methane samples from diverse systems, includ- 
ing lakes, wetlands, cow rumen, laboratory cul- 
tures of methanogenic microbes, and geological 
settings that may support abiogenic methane 
production. We used a recently developed tu- 
nable laser spectroscopy technique (14, 19) to 
measure the relative abundances of four meth- 
ane isotopologues (?7CH,, CH,, CH3D, and 
13CH.D). 

Our measurements for dominantly thermo- 
genic gases from the Marcellus and Utica shales 
CZ, 20) yielded A™CH,D-based temperatures of 
1477*35 °C and 160732°C, respectively. The clumped 
isotope temperature for the Marcellus Shale sam- 
ple is comparable to, although slightly lower than, 
estimates by Stolper et al. (16) of 179° to 207°C 
(Fig. 1). In addition, microbial methane in pore 
waters and gas hydrates from northern Cascadia 
margin sediments (3) and from wells producing 
from coal seams in the Powder River Basin (2, 27) 
yielded A“CH,D temperatures of 12° to 42°C and 
35° to 52°C, respectively. These are consistent 
with their expected low formation temperatures. 
Furthermore, thermogenic methane sampled from 
a hydrothermal vent in the Guaymas Basin, Gulf 
of California (6), yielded a A®CH,D temperature 
of 32615/°°C, within error of the measured vent 
temperature (299°C) (22). Therefore, our data 
provide independent support of the hypothesis 
that “CH;D abundance reflects the temperature 
at which methane is generated in these sedimen- 
tary basins (16). 

In contrast, we found that methane sampled 
from lakes, a swamp, and the rumen of a cow 
carries “CH3D signals that correspond to anom- 
alously high A°CH.D temperatures (139° to 775°C) 
(Fig. 1A) that are well above the environmental 
temperatures (<40°C). Such signals are clearly not 
controlled by equilibrium. Notably, a positive cor- 
relation between A“CH,D and the extent of D/H 
fractionation between methane and environmen- 
tal water [€methane/water (23) (Fig. 2)] suggests a 
strong link between isotopologue (ie., %“CH3D) 
and isotope (D/H) disequilibria. In contrast, the 
above-mentioned methane samples from sedi- 
mentary basins appear to have attained hydro- 
gen isotope equilibrium with associated waters 
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at or near the temperatures indicated by the 
A“CH.D data (Fig. 2). 

To confirm these observations from the nat- 
ural environment, we demonstrated that strong 
disequilibrium “CHD signals are also produced 
by cultures of methanogenic archaea in the lab- 
oratory (Fig. 3). Thermophilic methanogens cul- 
tured at 40° to 85°C produced methane with 
A®CH3D values from +0.5 to +2.3 per mil (%o) 
(corresponding to A’CH3D temperatures of 216° 
to 620°C), and mesophilic methanogens cultured 
at ambient temperature produced methane with 
conspicuously “anticlumped” signatures (i.e., val- 
ues of A’CH3D <0%o, for which no apparent 
temperature can be expressed) as low as -1.3%o 
(Fig. 3). Methane from cultures is also charac- 
terized by large kinetic D/H fractionation with 
respect to water (17, 24). Because laboratory cul- 


tures are grown under optimal conditions (high 
H, and high CO,), these anticlumped A’CH;D 
and low €methane/water Values are primarily ex- 
pressions of kinetic isotope effects. Consequently, 
the distribution of samples with A“CH;D and 
€methane/water Values in Fig. 2 can be explained 
by microbial methanogenesis operating on a spec- 
trum between fully kinetic (ow A“CH;D and low 
Emethane/water) and equilibrium (high ACH 3D 
and high €methane/water) end members. 

We constructed a mathematical framework to 
describe the controls on the correlation of A’CH;D 
and €methane/water Signals from hydrogenotrophic 
methanogenesis. The model largely follows those 
developed for microbial sulfate reduction (25, 26) 
and predicts the isotopologue compositions of 
product methane as a result of a series of en- 
zymatic reactions (fig. S4) (19). Using isotope 
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intervals (table S1). Data from (16) were scaled to their corresponding ACHsD values (15). The shaded 
area represents the temperature range within which microbial life has been demonstrated to date (35). 
The dotted line represents ASCH3D = 0%o (temperature T + «); data plotting below this line cannot 
yield corresponding apparent temperatures. (B) 58°C plotted against 8D, showing characteristic fields for 


different methane sources from (13). 
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fractionation factors estimated from theory, 
experiments, and observations as input parame- 
ters (table S3) (19), our model reproduces the 
observed correlation between A”CH;D and 
Emethane/water Of natural samples (Fig. 2). The 
isotopologue compositions of product methane 
reflect the degree of metabolic reversibility. Fully 
reversible reactions yield equilibrium end mem- 
bers (27), whereas irreversible reactions result in 
kinetic (disequilibrium) end-member signals. In 
this model, the reversibility is linked to available 
free energy (26, 27), in this case expressed as H, 
concentration ([H.]). The model can explain the 
relationship among [Hg], €methane/water (28), and 
A“CH3D via Michaelis-Menten kinetics and can 
predict the observed patterns in diverse set- 
tings, ranging from marine sediments (low 
[Ho], high A CH3D and Emethane/water) to 
bovine rumen (high [Hs], low AWCH3D and 
Emethane/water) (Fig. 4). We note that mixing of 
methane sources with different 5!C and 5D 
values or oxidation of methane could also alter 
the relationships over the primary signal of mi- 
crobial methanogenesis (19). Likewise, inheritance 
of clumping signals from precursor organic sub- 
strates (e.g., via acetoclastic or methylotrophic 
methanogenesis) cannot be ruled out entirely and 
awaits experimental validation. 

We showed above that the combination of 
A” CH3D and €methane/water Values provides mech- 
anistic constraints on whether methane was 
formed under kinetic versus near-equilibrium 
conditions. Next, we used this framework to place 
constraints on the origins of methane at two 
sites of present-day serpentinization in Phaner- 
ozoic ophiolites [The Cedars (29) and Coast Range 
Ophiolite Microbial Observatory (CROMO) (30)] 
in northern California, as well as in deep (>2 km 
below surface) fracture fluids with billion-year res- 
idence times in the Kidd Creek mine, Canada (5, 37). 

Methane collected from groundwater springs 
associated with serpentinization at The Cedars 
yielded anticlumped A“CHSD signals (-3%o) with 
OW €methane/water Values (Figs. 1A and 2). The data 
plot along the microbial (kinetic) trend defined in 
Fig. 2, supporting a previous hypothesis that meth- 
ane at The Cedars is being produced by active mi- 
crobial methanogenesis (29). The exceptionally 
high H, concentration (up to 50% by volume in 
bubbles) at The Cedars indicate the massive ex- 
cess of electron donors. This, along with severe 
inorganic carbon limitation [due to high pH (>11) 
and precipitation of carbonate minerals (29)], drives 
the formation of methane carrying strong kinetic 
imprints, consistent with the observed anticlumped 
A®CHD signals (Fig. 4). 

Despite the similarity in geologic setting, meth- 
ane associated with serpentinization at CROMO 
(30) revealed very different A®CH;D values, which 
correspond to low apparent temperatures (42° to 
76°C) and plot close to the equilibrium line (Fig. 2). 
Although the conventional 5"°C and 8D values of 
methane from CROMO are nearly identical to 
those of the Utica Shale sample (Fig. 1B), methane 
at CROMO carries much higher A“CH;D values 
(Fig. 1A). The origin of methane at the CROMO 
site remains unresolved (30), but the comparably 
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Fig. 2. Extent of 
clumped and hydro- 
gen isotopic disequili- 
bria in methane. 
Symbols and vertical 
error bars are the same 
as those in Fig. 1. 
Horizontal error bars 
represent uncertainties 
on estimates of 
Emethane/water (23) 
(table S4). The solid 
green curve represents 
isotopic equilibrium, 
with the Emethane/water 
calibration given by 
(36). Green shading 
represents ranges of 
€methane/water calibra- 
tions from published 
reports (fig. S3). Gray 
shading represents 
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high A“?CH3D values at CROMO suggest that 
methane here could be sourced from a mixture of 
thermogenic and microbial methane. Alternative- 
ly, lower Hy availability at CROMO, compared 
with The Cedars (table $4), may support microbial 
methanogenesis under near-equilibrium condi- 
tions (Fig. 4). Regardless, the different isotopo- 
logue signatures in methane from CROMO versus 
The Cedars demonstrate that distinct processes 
contribute to methane formation in these two ser- 
pentinization systems. 

Deep, ancient fracture fluids in the Kidd Creek 
mine in the Canadian Shield (37) contain copious 
quantities of both dissolved methane and hydro- 
gen (5). The Kidd Creek methane occupies a dis- 
tinct region in the diagram of A@CH3D versus 
Emethane/water (Fig. 2), due to strong D/H disequilibria 
between methane and water (4) and low-A“CH;D 
temperature signals of 56° to 90°C that are 
consistent with other temperature estimates for 
these groundwaters (4). Although the specific mecha- 
nisms by which the proposed abiotic hydrocarbons 
at Kidd Creek are generated remain under inves- 
tigation (5, 32), the distinct isotopologue signals 
provide further support for the hypothesis that 
methane here is neither microbial nor thermogenic. 

Our results demonstrate that measurements 
of °CH;D provide information beyond the sim- 
ple formation temperature of methane. The com- 
bination of methane and water hydrogen-isotope 
fractionation and “CH;D abundance enables the 
differentiation of methane that has been formed 
at extremely low rates in the subsurface (3, 21, 27) 
from methane formed in cattle and surface envi- 
ronments in which methanogenesis proceeds at 
comparatively high rates (33, 34). 
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ISOTOPE GEOCHEMISTRY 


Biological signatures in clumped 


isotopes of 0, 


Laurence Y. Yeung,’?*+ Jeanine L. Ash,’*+ Edward D. Young” 


The abundances of molecules containing more than one rare isotope have been applied 
broadly to determine formation temperatures of natural materials. These applications of 
“clumped” isotopes rely on the assumption that isotope-exchange equilibrium is reached, 
or at least approached, during the formation of those materials. In a closed-system 
terrarium experiment, we demonstrate that biological oxygen (O2) cycling drives the 
clumped-isotope composition of O2 away from isotopic equilibrium. Our model of the 
system suggests that unique biological signatures are present in clumped isotopes of 
O2—and not formation temperatures. Photosynthetic O2 is depleted in 120780 and 170180 
relative to a stochastic distribution of isotopes, unlike at equilibrium, where heavy-isotope 
pairs are enriched. Similar signatures may be widespread in nature, offering new tracers of 


biological and geochemical cycling. 


tatistical thermodynamics predicts that 
heavy isotopes will be bound together in 
a molecule more often than predicted by 
chance alone, provided the system is at iso- 
topic equilibrium (J, 2). This preference for 
heavy-isotope pairing and its variation with tem- 
perature forms the basis of clumped-isotope ther- 
mometry (3-5), a class of approaches based on 
precise measurements of molecules containing 


more than one rare isotope. When isotope-exchange 
reactions facilitate the equilibration of heavy- 
isotope pairs, the resulting isotopic distribution 


Department of Earth, Planetary, and Space Sciences, 
University of California, Los Angeles, CA 90095, USA. 
Department of Earth Science, Rice University, Houston, TX 
77005, USA. 

*Corresponding author. E-mail: lyeung@rice.edu (L.Y.Y.); jlash@ 
ucla.edu (J.L.A.) {These authors contributed equally to this work. 


24 APRIL 2015 * VOL 348 ISSUE 6233 431 


RESEARCH | REPORTS 


has indeed been shown to achieve equilibrium 
across a wide range of temperatures (4, 6-8); 
however, isotopic equilibrium is the exception 
rather than the rule in nature. Biogenic sub- 
stances, for example, are often formed through 
irreversible enzymatic reactions for which isotope- 
exchange equilibrium cannot be expected a priori. 
Yet, many natural materials with kinetically con- 
strained and/or biological origins (e.g., carbonate 
shells) show only minor departures from equi- 
librium isotope fractionation (9-11). Large biolog- 
ical and physical effects on heavy-isotope pairing 
could complicate the interpretation of emerging 
clumped-isotope thermometers in methane, O., 
and other candidate systems (4, 5, 12). 

Here, we consider photosynthetic O. forma- 
tion from water at the oxygen-evolving complex 
of Photosystem IIT (OEC). In the OEC, O-O bond 
formation occurs at the end of a five-step light- 
dependent sequence (Fig. 1). This reaction most 
likely does not equilibrate O-O isotope pairs giv- 
en the lack of isotopic equilibration between 
water and the O, produced (13-16). We argue 
that the tendency for two heavy oxygen isotopes 
to be bound together during oxygenic photosyn- 
thesis reflects primarily the isotopic preferences 
of water molecules binding to the OEC. These 
patterns of heavy-isotope pairing should be ap- 
parent in clumped isotopes of Oj. Measurements 
of the 080 (mass 36) and %0"8O (mass 35) 
isotopologues of O., together with bulk isotopic 
ratios (80/80 and ™O/"%0), characterize the 
number of heavy-isotope pairs in a sample rela- 
tive to the number expected by chance alone (i.e., 
the stochastic distribution). These deviations 
are quantified as A3, and Ag; values: Excesses of 
89'80 and 0"*0 relative to the stochastic dis- 
tribution of isotopes in the sample results in 
Az6 > O and Ags > O, respectively. A deficit in 
89'80 and 00 results in Agg < O and Ags < 0. 

The Age and Ags signatures of oxygenic photo- 
synthesis can thus be estimated by assigning each 
water-binding site its own isotopic fractionation 
factor o = “Ryouma/“Rwater Where '®R is the ratio 
of 80 to '°O atoms in each reservoir. At natural 
isotopic abundances, the bulk isotopic composi- 
tion of photosynthetic O. is the weighted sum of 
those contributions—ie., °R, = 12; [C’Rwater * 
ta) + C8Rwater ¥ Op)], with binding sites A and B 
each contributing one of two oxygen atoms in 
each O» molecule. The probability of generating 
80-80 bonds is therefore °°R,, = (SRwater X Oa) 
(8R ater X Op). The stochastic distribution of 0 
atoms is calculated from the bulk 80/70 ratio as 
*Retocnastic = (°Rp)”. The expression for Ase,» 
then reduces to (17) 


z | (1) 


Equation 1 reveals that, in all cases, Azg, < 0; 
contrary to the enhanced isotope pairing that 
would be expected at isotopic equilibrium, there 
is an apparent aversion to heavy-isotope pairing 
associated with photosynthetic O, production. If 
the isotopic preferences at each water-binding 
site are equal (o, = og), then Age, = 0. If the 
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binding sites are not equivalent (a, * Gg), as 
isotope-labeling studies indicate (78, 19), then 0 = 
A®°? > —0,9 per mil (%o) for plausible o-values 
between 0.97 and 1.03 (20, 27). A similar expres- 
sion can be derived for A3;,, values, which are 
predicted to be about half those of Age, (see the 
supplementary text). These values cannot be in- 
terpreted as formation temperatures because all 
equilibrated samples have A,, = 0 (2). Photosyn- 
thesis should therefore impart a distinct non- 
equilibrium clumped-isotope signature on Oy. 

We conducted a closed-system terrarium ex- 
periment with six water hyacinths (Eichhorniae 
crassipes) to explore the effects of biological oxy- 
gen cycling on five isotopologues of O, (17). The 
terrarium was illuminated with fluorescent lights 
on a 12-hour/12-hour light-dark cycle. Headspace 
samples were purified and analyzed over a 1-year 
period for both the bulk and clumped isotopic 
composition of O.. We found that biological oxy- 
gen cycling altered isotopic ordering in the head- 
space Os, yielding apparent steady-state Az, and 
Ags Values that are inconsistent with O, forma- 
tion temperatures and more consistent with the 
predicted photosynthetic endmembers (Fig. 2 
and table S3). The Agg and Ag; values of O2 were 
driven down from atmospheric values [2%o and 
1%o, respectively (4)] and down past equilibrium 
values at 25°C (1.5%o and 0.8%o, respectively), 
finally approaching an apparent isotopic steady 
state at the stochastic distribution of isotopes 
(Agg = -0.01 + 0.08%o, and Ags = 0.0 + 0.1%; 
1 SEM, n = 4). The plant community shifted to 
an algae-dominated ecosystem during the first 
6 months, altering the isotopic, chemical, and 
physical properties of the terrarium (fig. S1). 
However, the clumped-isotope composition of 
the headspace O, evolved steadily toward its ap- 
parent steady state, similar to the evolution of 
the oxygen triple-isotope composition. Steady- 
state A’”O values were 165 parts per million 
(ppm), consistent with those reported in similar 
experiments (22, 23). 

Dark incubations of the terrarium, which con- 
sumed up to 35% of the headspace Ox, caused Az6 
values to increase linearly with time up to ~1%o 
(Fig. 2). The Az; values, in contrast, remained 
generally constant (means of Az, = 0.1 + 0.1%o0 
and 0.1 + 0.05%o; 1 SD). Returning to light-dark 
cycles restored the clumped-isotope composition 
to its apparent steady-state value after 6 months 
(Agg = -0.09 + 0.06%o, and Ags = 0.0 + 0.1%0; 
1 SEM, 7 = 3). To test the veracity of these mea- 
surements, headspace O, samples drawn from 
both light and dark incubations were photolyt- 
ically equilibrated at known temperatures (4). 
The equilibrations yielded A3g and A3; values of 
O, consistent with isotope-exchange equilibrium 
(table S3), suggesting that our observations are 
unlikely to be analytical artifacts. Atmospheric 
O, leaking into the terrarium would increase 8"°O 
far too rapidly relative to Agg to explain these 
observations. The observed clumped-isotope varia- 
tions therefore most likely arise from biological 
and physical processes inside the terrarium. 

We constructed a two-reservoir model of O4 
(i.e., in headspace and water) in the terrarium 


Fig. 1. Conceptual diagram of O2 formation at 
the OEC. The five-step Kok cycle for the water- 
splitting reaction 2H20 + 4hv > Os + 4H* + 4e7 is 
shown without electron flow (32). Transitions be- 
tween intermediate oxidation states of the OEC (Sg 
to S4) occur upon absorption of visible light. The 
water-binding sequence is based on experimental 
results (19, 33, 34), which also indicate that water 
substrates are exchangeable at least up to state S3 
on chemically distinct binding sites (18, 19). The 
O-O bond is formed during the S,-to-So transition, 
expressing the isotopic fractionations a, and ap 
from water substrate binding. 


that accounts for photosynthetic O. formation, 
fractionation of O, due to respiration, and air- 
water gas exchange (17). We included kinetic iso- 
tope fractionation for gas transfer into and out of 
solution [*“oczxinetic = 0.9972 for “O/'°O (24)]. 
The model was run with a range of plausible 
isotope fractionation factors for respiration [“op = 
0.97 - 0.99 (25, 26)] and gas-exchange rates (24, 27) 
to examine the sensitivity of the terrarium head- 
space to changes in those quantities. The oxygen 
triple-isotope composition of the terrarium water 
was measured and used as the bulk isotopic com- 
position of photosynthetic O» (13, 15, 17). No sin- 
gle set of parameters explained all of the isotopic 
variations during the entire experiment, likely 
due to the evolving biological community, so we 
focus on isotopic variations at steady state and 
during dark incubations. 

The increase of headspace Azg and Az; values 
in the dark implies that the apparent steady- 
state values near zero can only be reached if 
light-dependent processes drive A3g and Ag, val- 
ues below zero. Equation 1 suggests that photo- 
synthesis could be the relevant mechanism, because 
the O, generated is likely to have Age, and Ag5,p 
values less than zero. To estimate the compo- 
sition of this source, we note that kinetic and 
equilibrium isotope effects for relevant photo- 
synthetic fractionations are probably in the range 
0.96 > 8a > 1.04 (20, 21), which we broaden to a 
more conservative plausible range of 0.9 > a > 
1.1. This range of isotope effects gives lower limits 
on Age and Ags, of -10%o and -5%o, respectively. 

If the Ags increase during dark incubations 
were solely caused by fractionation in respira- 
tion, then large isotope effects in water-enzyme 
binding would be required: Azg,, < -10%bo is 
needed to achieve steady-state values of Azg near 
zero (17). In addition, the associated Ag35,, < -5%o 
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Fig. 2. Evolution of con- 
centration and O2 isoto- 
pologue composition in 
the terrarium. Observa- 
tions (data points) are 
compared with model 
results (curves). Uncer- 
tainties are not shown for 
clarity, but long-term 
analytical uncertainties in 
O> concentration, 880, 
AO, Age, and Ags are 1%, 
0.04%, 5 ppm, 0.17%o, 
and 0.3%o0, respectively. 
A single isotopologue 
discrimination factor 
(“ep = -17%o) is used 
here to illustrate steady- 
state behavior in 8780 
and A”’0; a more detailed 
model run yields better 
agreement for 880 and 
A”’O but similar results 
for Age and Ags. Mass- 
dependent exponents 
used in the model, B3ayn, 
are labeled, with sub- 
scripts R and GE denoting 
values for respiration and 
gas exchange, respec- 
tively. For B34/35,ce and 
B34/36,ce, two model runs 
are shown to illustrate 
their effects on the Age 
and A3s5 time traces (17). 


Aye (%o) 


5° (%o vs. Air O,) 


endmember composition causes poor agreement 
between measured and modeled Ag; values (fig. 
S4C). Furthermore, an increase in respiration 
rates would drive Azg and Az; values higher, whereas 
a decrease in respiration rates would drive the O, 
toward its Azg,, and A35,, photosynthetic values 
(17). Therefore, when the O, cycle was out of bal- 
ance in the first 6 months, Azz would have fluc- 
tuated inversely with O, concentration (fig. S4, 
B and C). Instead, both Azg and A3; decreased 
nearly monotonically. 

Isotopologue fractionation during nonequilib- 
rium O, gas exchange could explain the in- 
creases of headspace Agg and A3;5 values during 
dark incubations. The fractionation in headspace 
69'80/'O, is closer to that for gas exchange 
than that for respiration (“ogpservea = 0.995 ver- 
sus **ogz;inetic = 0.9972 versus **az ~ 0.98), sug- 
gesting that the Az, and Ag, increases are similarly 
dominated by gas exchange. Modeling the mass 
dependence of gas exchange using the dark incu- 
bation data yields A3g, and A35,, values within a 
plausible range (i.e., Ag, = -0.4%o, Ags, = -0.2%o) 
(Fig. 2). The evolution of Agg and Ag; is also more 
robust to imbalances in the O, cycle (17). Other 
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oxygen-consumption mechanisms, such as sulfide 
oxidation, could impart additional isotopologue 
signatures (28), so attributing isotopologue dis- 
crimination in the dark to a single process is 
necessarily a simplification. Indeed, the implied 
mass dependence of O2 consumption in the dark 
terrarium is unusual, and it merits further in- 
vestigation (17). A detailed understanding of iso- 
topologue fractionation factors will require more 
controlled experiments of isolated biological and 
physical processes. Yet, the specific isotopologue 
discrimination during dark incubations does not 
affect the conclusion that photosynthesis gen- 
erates O. with an “anticlumped” isotopologue 
distribution (i.e., Agg < O and Ags < 0). This bio- 
logical signature in Oy may be readily observed 
in the surface ocean, where it could be used to 
constrain gross primary productivity by exploit- 
ing the contrast between biological and atmo- 
spheric O, clumped-isotope signatures (29). Isotopic 
ordering in atmospheric O, is relatively unaf- 
fected by biological O, cycling because photo- 
chemical equilibration of O, exceeds rates of 
biological cycling by at least a factor of 100 (4, 30). 
Using a biological endmember composition of 


Dark 
incubation 


12h/12h 
Light/dark cycles 


Baaiss.oe = 1.8 


Bsars3,p = 9.517 
Bssis3,ce = 9-510 


50 100 
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Age = 0, we calculate that biological effects on the 
tropospheric A3, budget are therefore most likely 
on the order of 0.01%o. 

Our observations indicate that variations in 
the isotopologue abundance of even simple mol- 
ecules like O, capture the chemistry of complex 
natural systems. Broader application of these tech- 
niques could yield insights into the mechanisms of 
biomolecule synthesis, e.g., methanogenesis, nitro- 
gen reduction during denitrification, and molec- 
ular hydrogen release during nitrogen fixation 
(31). Moreover, because clumped-isotope signa- 
tures can depend only on isotope fractionation 
factors and not on the isotopic composition of 
substrates, a new class of reservoir-insensitive ap- 
proaches for tracing biogeochemical cycling could 
emerge from these molecular-scale insights. 
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Big names or big ideas: Do peer-review 
panels select the best science proposals? 


Danielle Li'’*+ and Leila Agha”**+ 


This paper examines the success of peer-review panels in predicting the future quality of 
proposed research. We construct new data to track publication, citation, and patenting 
outcomes associated with more than 130,000 research project (RO1) grants funded by 

the U.S. National Institutes of Health from 1980 to 2008. We find that better peer-review scores 
are consistently associated with better research outcomes and that this relationship persists 
even when we include detailed controls for an investigator’s publication history, grant history, 
institutional affiliations, career stage, and degree types. A one-standard deviation worse 
peer-review score among awarded grants is associated with 15% fewer citations, 7% fewer 
publications, 19% fewer high-impact publications, and 14% fewer follow-on patents. 


n 2014, the combined budgets of the U.S. Na- 
tional Institutes of Health (NIH), the U.S. Na- 
tional Science Foundation, and the European 
Research Council totaled almost $40 billion. 
The majority of these funds were allocated to 
external researchers whose applications were vetted 
by committees of expert reviewers. But as funding 
has become more competitive and application 
award probabilities have fallen, some observers 
have posited that “the system now favors those 
who can guarantee results rather than those with 
potentially path-breaking ideas that, by definition, 
cannot promise success” (7). Despite its importance 
for guiding research investments, there have been 
few attempts to assess the efficacy of peer review. 
Peer-review committees are unique in their 
ability to assess research proposals based on deep 
expertise but may be undermined by biases, insuf- 
ficient effort, dysfunctional committee dynamics, 
or limited subject knowledge (2, 3). Disagreement 
about what constitutes important research may 
introduce randomness into the process (4). Exist- 
ing research in this area has focused on under- 
standing whether there is a correlation between 
good peer-review scores and successful research 
outcomes and yields mixed results (5-7). Yet raw 
correlations do not reveal whether reviewers are 
generating insight about the scientific merit of 
proposals. For example, if applicants from elite 
institutions generally produce more highly cited 
research, then a system that rewarded institutional 
rankings without even reading applications may 
appear effective at identifying promising research. 
In this paper, we investigate whether peer re- 
view generates new insights about the scientific 
quality of grant applications. We call this ability 
peer review’s “value-added.” The value-added of 
NIH peer review is conceptually distinct from the 
value of NIH funding itself. For example, even if 
reviewers did a poor job of identifying the best 
applications, receiving a grant may still improve 
a researcher's productivity by allowing her to main- 
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tain a laboratory and support students. Whereas 
previous work has studied the impact of receiv- 
ing NIH funds on the productivity of awardees 
(8, 9), our paper asks whether NIH selects the 
most promising projects to support. Because NIH 
cannot possibly fund every application it receives, 
the ability to distinguish potential among appli- 
cations is important for its success. 

We say that peer review has high value-added 
if differences in grants’ scores are predictive of 
differences in their subsequent research output, 
after controlling for previous accomplishments 
of the applicants. This may be the case if review- 
ers generate additional insights about an appli- 
cation’s potential, but peer review may also have 
zero or even negative value-added if reviewers are 
biased, mistaken, or focused on different goals (0). 

Because research outcomes are often skewed, 
with many low-quality or incremental contribu- 
tions and relatively few ground-breaking discov- 
eries (2, 11), we assess the value-added of peer 
review for identifying research that is highly in- 
fluential or shows commercial promise. We also 
test the effectiveness of peer review in screening 
out applications that result in unsuccessful re- 
search (see the supplementary materials for full 
details on data and methods). 

NIH is the world’s largest funder of biomedical 
research (2). With an annual budget of approxi- 
mately $30 billion, it supports more than 300,000 
research personnel at more than 2500 institutions 
(12, 13). A funding application is assigned by topic 
to one of approximately 200 peer-review com- 
mittees (known as study sections). 

Our main explanatory variable is the “percentile 
score,” ranging from 0 to 100, which reflects an ap- 
Dlication’s ranking among all other applications 
reviewed by a study section in a given fiscal year; 
lower scores correspond to higher-quality applica- 
tions. In general, applications are funded in order 
of their percentile score until the budget of their 
assigned NIH institute is exhausted. The average 
score in our sample is 14.2, with a standard deviation 
(SD) of 10.2; only about 1% of funded grants in 
our sample had a score worse than 50. Funding 
has become more competitive in recent years; 
only 14% of applications were funded in 2013. 
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Our sample consists of 137,215 research project 
(RO1) grants funded from 1980 through 2008. ROIs 
are project-based renewable grants that are NIH’s 
primary grant mechanism, accounting for about 
half of its extramural grant spending. Of the grants 
in our sample, 56% are for new projects; the re- 
maining successfully competed for renewal. We 
focus on funded grants because funding is likely 
to have direct effect on research productivity, mak- 
ing it difficult to infer the success of peer review by 
comparing funded and unfunded grants. Because 
our sample grants have the same funding status, 
we can attribute any remaining relationship between 
scores and outcomes to peer review, rather than 
funding. Because grants are almost always funded 
in order of their score, there is relatively little scope 
for selection on unobservables to introduce bias. 


Our primary outcome variables are (i) the total 
number of publications that acknowledge grant 
support within 5 years of grant approval (via 
PubMed); (ii) the total number of citations that 
those publications receive through 2013 (via Web 
of Science); and (iii) patents that either directly 
cite NIH grant support or cite publications ac- 
knowledging grant support [via the U.S. Patent 
and Trademark Office (USPTO)]. These publica- 
tion, citation, and patent outcomes are designed 
to reflect NIH’s stated goals of rewarding research 
with high scientific and technical merit. 

We also measure applicant-level characteristics: 
an investigator's publication and grant history, edu- 
cational background, and institutional affiliation. 
We match investigators with publications using 
their full last name and their first and middle initials 


(14). We track the number of articles an applicant 
published in the 5 years before submitting her ap- 
plication, as well as the impact of those publica- 
tions as measured by the citations they have received 
by the time the application is evaluated. We iden- 
tify “high-impact” publications as being among the 
top 0.1%, 1%, and 5% most cited, compared with 
articles published in the same year. To more pre- 
cisely assess the quality of an applicant’s ideas, we 
repeat this exercise for articles in which the appli- 
cant is a first or last author only. Our regression 
results include separate controls for each type of 
publication: any authorship position, and first or 
last author publications. By counting only citations 
received up to the date of grant review, we ensure 
that our measures contain only information available 
to reviewers at the time they evaluate the application. 


Table 1. Do peer-review scores predict future citations and publications? 
Each reported figure is the coefficient on scores from a single Poisson 
regression of grant outcomes on NIH peer-review scores; standard errors 
are reported in parentheses. The actual sample size used per regression 
depends on the number of nonzero observations for the dependent variable. 
The independent variable is the percentile score. “Future citations” refers to 
the total number of citations, to 2013, that accrue to all publications that 
acknowledge funding from a given grant. “Future publications” refers to the 
total number of such publications. Subject-year controls refer to study section 


by fiscal year fixed effects, as well as NIH institute fixed effects. Pl publication 
history includes controls for number of past publications, number of past 
citations, and number of past hit publications. Pl career characteristics include 
controls for degrees and experience (time since highest degree). PI grant 
history controls for number of previous ROls and non-RO1 NIH funding. Pl 
institution and demographics control for the rank of the PI's institution, as well 
as gender and some ethnicity controls. Standard errors are clustered at the 
study section year level. *, statistical significance at the 10% level; **, 5% level; 
*** 1% level. 


Model 1 Model 2 Model 3 Model 4 Model 5 Model 6 
Dependent variable: Future citations 
nGeP ene Vanenle: ~0.0203***  -0,0215***  — -0.0162*** — -0,0164*** —-0.0162*** —_ -0.0158*** 
NIH percentile score 
(0.0006) (0.0008) (0.0007) (0.0007) (0.0007) (0.0007) 
N 137,215 136,076 136,076 128,547 128,547 128,547 
Dependent variable: Future publications 
ei venice variable: -0.0155***  -0.0091*** —_-0.0076*** — -0.0077*** — -0.0076*** —— -0.0075*** 
percentile score 
(0.0003) (0.0003) (0.0003) (0.0003) (0.0003) (0.0003) 
N 137,215 136,111 136,111 128,580 128,580 128,580 
Controls 
Subject-year X X Xx Xx Xx 
PI publication history X X Xx X 
PI career characteristics X X Xx 
PI grant history X X 
PI institution/demographics X 
2. § 
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ro oe 
Se ° 
z 3 
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Fig. 1. Scatterplot of percentile scores and grant outcomes. The left panel plots the relationship between percentile scores and citations associated with a 
grant. Each dot represents a single grant. The right panel does the same for total publications. Extreme outliers with more than 10,000 citations or 200 


publications are not displayed here. 
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the relatively well-developed projects that receive 
NIH RO! funding (5). The median grant in our 
sample received 116 citations to publications ac- 
knowledging the grant; the mean is more than 
twice as high, 291, with an SD of 574. This varia- 
tion in citations underscores the potential gains 
from being able to accurately screen grant appli- 
cations on the basis of their research potential. 

Our first set of results describes peer review’s 
value-added for identifying research likely to re- 
sult in many publications or citations. Table 1 re- 
ports results from Poisson regressions of future 
outcomes on peer-review scores, with different 
controls for an applicant’s previous performance. 
The supplementary materials describe many addi- 
tional robustness checks. 

Model 1 of Table 1 reports, without any control 
variables, the percentage change in the number 
of citations and publications associated with a 
grant, given a one point increase in its percentile 
score. We find that NIH evaluations are statisti- 


We observe whether an applicant has an M.D., 
Ph.D., or both, as well as the year in which she 
received her final doctoral degree. We are missing 
degree and experience information for 0.45% and 
7.16% of our sample, respectively; we include two 
separate indicators for missing these data. We 
measure whether this applicant previously received 
an RO1 grant and whether the applicant has re- 
ceived any previous NIH funding. Using the name 
of the principal investigator (PI), we employ a 
probabilistic algorithm developed by Kerr to deter- 
mine applicant gender and ethnicity (Hispanic or 
Asian) (15, 16, 17). We rank applicants’ institu- 
tions by the number of NIH grants received over 
our study period and measure whether each appli- 
cant is from a top 5-, 10-, 20-, or 50-ranked insti- 
tution. We are unable to determine the institutional 
affiliation of 14% of investigators; we include an 
indicator variable for missing institution information. 

Consistent with previous work, there is sub- 
stantial dispersion in research output even among 
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Fig. 2. Smoothed scatterplots of percentile scores and residual grant outcomes. These figures 
display smoothed scatterplots of the nonparametric relationship between unexplained variation in grant 
outcomes and percentile score, after accounting for differences in field of research, year, and applicant 
qualifications. The left panel plots the relationship between percentile scores and residual citations 
associated with a grant. The right panel does the same for residual publications. 


cally related to grant quality; our estimated coef- 
ficients indicate that a one percentile point worse 
peer-review score is associated with 1.6% fewer 
publications and 2% fewer citations. To consider 
the magnitude of these findings more clearly, we 
will describe our results by reporting how predicted 
outcomes change with a 1-SD (10.17 point) worse 
percentile score; in Model 1, a 1-SD worse score is 
associated with a 14.6% decrease in grant-supported 
research publications and a 18.6% decrease in 
citations to those publications (P < 0.001). This 
calculation is based on the overall SD in percentile 
score among funded grants, unconditional on PI 
characteristics (78). Figure 1 illustrates the raw re- 
lationship between scores and citations and publi- 
cations in a scatterplot; the plot suggests a negative 
sloping relationship (recall that higher percentile 
scores indicate less favorably reviewed research). 

There are potential concerns with interpreting 
the unadjusted relationship between scores and 
outcomes as a measure of peer review’s value. 
Some grants may be expected to produce more 
citations or publications and thus appear higher 
quality, independent of their true quality. Older 
grants have more time to produce publications 
that in turn have more time to accrue citations. A 
publication with 100 citations may be average in 
one field but exceptional in another. 

Model 2 of Table 1 addresses these concerns by 
including detailed fixed effects for study sections 
by year cells and NIH institutes. The inclusion of 
these fixed effects means that our estimates are 
based only on comparisons of scores and out- 
comes for grants evaluated in both the same 
fiscal year (to account for cohort effects) and in the 
same study section (to account for field effects). 
We also include NIH institute-level fixed effects 
to control for differences in citation and publica- 
tion rates by fields, as defined by a grant’s area of 
medical application. Controlling for cohort and 
field effects does not attenuate our main finding. 
For a 1-SD (10.17 point) worse score, we expect an 
8.8% decrease in publications and a 19.6% decrease 
in citations (both P < 0.001). This suggests that 
scores for grants evaluated by the same study 


Table 2. Do peer-review scores predict hit publications and follow-on patents? Each reported figure is the coefficient on scores from a single Poisson 
regression of grant outcomes on NIH peer-review scores; standard errors are in parentheses. High-impact publication is given by the count of publications 
acknowledging the grant that receive more citations than all but 0.1%, 1%, or 5% of publications from the same year. Direct patents are those that acknowledge 
funding from a grant; indirect patents are those that cite publications that acknowledge funding from a grant. We control for the same variables as described in 


Model 6 of Table 1. 


Dependent variable: High-impact publications 


Top 0.1% Top 1% Top 5% 
(1) (2) (3) 
IESpencent Variable -0.0246*** -~0.0209*** -0.0172*** 
NIH percentile score 
(0.0025) (0.0014) (0.0009) 
N 88,795 118,245 125,021 
Controls 
Subject-year Xx Xx Xx 
PI publication history X X X 
PI career characteristics X X X 
PI grant history X X X 
PI institution/demographics X X X 
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Dependent variable: Patents 


Direct Indirect 
(4) (5) 
-0.0153*** -0.0149*** 
(0.0015) (0.0022) 
122,850 92,893 
X Xx 
Xx Xx 
xX Xx 
X X 
Xx X 
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section in the same year and assigned to the same 
NIH institute are better than randomly allocated. 
We may observe this pattern, however, if review- 
ers simply give good scores to applicants with strong 
research credentials, and applicants with strong 
credentials generally tend to produce better research. 
Model 3 of Table 1 adds controls describing a PI’s 
publication history in order to ask whether study 
section scores contain information about the qual- 
ity of an application that could not be predicted 
by simply examining a PI’s curriculum vita. 
Specifically, we include the following additional 
control variables: (i) the number of articles pub- 
lished in the past 5 years; (ii) the total number of 
citations those articles have received up to the 
year of grant review; (iii) three variables describ- 
ing the number of top 0.1%, 1%, and 5% articles 
that the PI has published in the previous 5 years; 
and (iv) alternate versions of these variables con- 
structed only with the subset of publications for 
which the applicant was a first or last author. 
Controlling for publication history attenuates 
but does not eliminate the relationship: a 1-SD 
(10.17 point) worse score is associated with a 7.4% 
decrease in future publications and a 15.2% de- 
crease in future citations (both P < 0.001). 


The association between better scores and better 
outcomes could also be explained by the Matthew 
effect, a sociological phenomenon wherein credit 
and citations accrue to established investigators 
simply because they are established, regardless of 
the true quality of their work (19, 20). Were this 
the case, more connected applicants may receive 
better scores and more citations regardless of the 
true quality of their work. Our approach may thus 
credit peer review for responding to prestige, rather 
than the underlying quality of an applicant’s ideas. 

Model 4 controls for the PI’s experience by 
adding indicators for whether the applicant has 
an M.D., Ph.D., or both, as well as a series of 
indicator variables capturing how many years 
have elapsed since receiving her terminal degree. 
If reviewers were simply giving better scores to 
candidates with more experience or skill writing 
grant proposals and publishing papers, then we 
would expect scores to become less predictive of 
future research output once we control for M.D./ 
Ph.D. status and time since degree. Instead, our 
estimated relationship between peer-review scores 
and outcomes remains unchanged. 

Model 5 considers the possibility that peer re- 
viewers may be rewarding an applicant’s grant 
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proposal writing skills rather than the underly- 
ing quality of her work. Specifically, we include 
variables controlling for whether the PI received 
NIH funding in the past, including four indica- 
tors for having previously received one RO1 grant, 
two or more ROI grants, one NIH grant other 
than an ROI, and two or more other NIH grants. 
To the extent that reviewers may be responding 
to an applicant’s experience and skill with pro- 
posal writing, we would expect the inclusion of 
these variables reflecting previous NIH funding 
to attenuate our estimates of value-added. We find, 
however, that including these variables does not 
substantively affect our findings. 

Finally, in Model 6, we also control for insti- 
tutional quality, gender, and ethnicity, to capture 
other potentially unobserved aspects of prestige, 
connectedness, or access to resources that may 
influence review scores and subsequent research 
productivity. Our estimates again remain stable: 
comparing applicants with statistically identical 
backgrounds, the grant with a 1-SD worse score 
is predicted to have 7.3% fewer future publications 
and 14.8% fewer future citations (both P < 0.001). 

Across Models 3 to 6, the estimated relation- 
ship between peer-review scores and outcomes 
remains remarkably stable, even as we add more 
covariates that describe an applicant’s past accom- 
plishments, prestige, proposal-writing skill, and 
professional connections. Although these variables 
certainly cannot capture every potential source 
of omitted variables bias, the stability of our results 
suggests that political connections and prestige are 
not a primary driver of peer review’s value-added. 

Next, we explore whether reviewers’ expertise 
enables them to identify the strongest applica- 
tions or to more efficiently screen out weaker 
applications. We use a local linear regression 
model to nonparametrically identify the relation- 
ship between peer-review score and research qual- 
ity. This flexibility will allow the predictive power 
of peer-review scores to differ at each point along 
the score spectrum. We implement this approach 
in two steps, which are described in detail in the 
supplementary materials. First, we construct the 
residuals from a linear regression of research out- 
comes on all of the explanatory variables in Mod- 
el 6, excluding the study section percentile score 
itself. These residuals represent the portions of 
grants’ citations or publications that cannot be 
explained by applicants’ previous qualifications 
or by application year or subject area (as detailed 
above). We then produce a locally weighted, linearly 
smoothed scatterplot relating peer-review scores 
to these residual citations and publications. 

Figure 2 shows that peer reviewers add value 
by identifying the strongest research proposals. 
For all percentile scores less than 50 (the vast ma- 
jority of awarded grants), worse scores are asso- 
ciated with lower expected residual citations and 
publications. The relationship is particularly 
steep at very low percentile scores, suggesting 
that study sections are particularly effective at 
discriminating quality among very well-reviewed 
applications. 

One notable exception occurs for very poorly 
scored applications—those with percentile scores 
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over 50—that were nonetheless funded. In this 
range, worse review scores are associated with 
higher citation counts. These applications con- 
stitute about 1% of funded applications and are 
highly unlikely to have met the standard award 
threshold but were instead funded “out of order.” 
We find higher average quality for this set of 
selected grants, suggesting that when program 
officers make rare exceptions to peer-review de- 
cisions, they are identifying a small fraction of 
applications that end up performing better than 
their initial scores would suggest. 

Our final analysis explores whether peer re- 
viewers’ value-added comes from being able to 
identify transformative science, science with con- 
siderable applied potential, or from being able to 
screen out very low-quality research. We define a 
“hit” publication as among the top 0.1%, 1%, or 
5% most cited publications in its cohort, using all 
citations a publication receives through 2013. To 
explore whether reviewers have value-added in 
terms of identifying research with practical appli- 
cations, we track the number of patents that 
explicitly acknowledge NIH funding. The major- 
ity of NIH grants, however, do not directly result 
in patents. Thus, we also count the number of pat- 
ents that cite research funded by a grant (indirect 
patenting).We construct this variable by linking 
grants to publications using grant acknowledg- 
ment data and then applying a fuzzy matching 
algorithm that identifies publications cited by 
USPTO patents (27). This allows us to identify 
patents that cite publications that in turn acknowl- 
edge a grant. Importantly, this process (described 
further in the supplementary materials), allows 
us to identify patents regardless of whether those 
patents are assigned to the same investigator 
funded by the NIH grant. Indeed, most often these 
patents are held by private firms (22). 

As reported in Table 2, peer-review scores have 
value-added identifying hit publications and re- 
search with commercial potential. A 1-SD (10.17 
points) worse score is associated with a 22.1%, 
19.1%, and 16.0% reduction in the number of 
top 0.1%, 1%, and 5% publications, respectively. 
These estimates are larger in magnitude than our 
estimates of value-added for overall citations, 
especially as we consider the very best publica- 
tions. The large value-added for predicting tail 
outcomes suggests that peer reviewers are more 
likely to reward projects with the potential for a 
very high-impact publication and have considerable 
ability to discriminate among strong applications. 

A 1-SD worse percentile score predicts a 14% 
decrease in both direct and indirect patenting. 
Because of the heterogeneous and potentially long 
lags between grants and patents, many grants in 
our sample may one day prove to be commercially 
relevant even if they currently have no linked 
patents. This time-series truncation makes it more 
difficult to identify value-added with respect to 
commercialization of research and means that 
our estimates are likely downward biased. 

Finally, we investigate the nonparametric rela- 
tionship between percentile scores and publica- 
tion outcomes, testing which score ranges are 
associated with the highest numbers of “hit” 
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publications, ranking at the top of the citation 
distribution, and which score ranges are associ- 
ated with the highest numbers of “miss” publica- 
tions, ranking near the bottom of the distribution. 
We follow the same local linear regression smooth- 
ing procedure outlined above and described in 
more detail in the supplementary materials. 

Figure 3 shows that low percentile scores are 
consistently associated with higher residual num- 
bers of hit publications, variation unexplained 
by the applicant’s background or field of study. 
The relationship between scores and residual re- 
search outcomes is steepest among the most well- 
reviewed applications. For example, funded grants 
with percentile scores near 0 are predicted to 
produce 0.05 more publications in the top 0.1% 
of the citation distribution, compared with appli- 
cations scored near the 10th percentile (holding 
constant applicant qualifications and field). 

Although this may seem like a modest increase, 
there is a small number of such hit publications, 
so a 0.05 increase in their number corresponds 
to a doubling of the mean number of top 0.1% 
publications arising from a grant. This relation- 
ship between scores and hit publications becomes 
weaker among applications with less competitive 
scores; a 10-percentile point difference in scores 
in the range of 20 to 30 would predict only a 
0.0004 difference in the number of top 0.1% pub- 
lications. This finding runs counter to the hypo- 
thesis that, in light of shrinking budgets and 
lower application success rates, peer reviewers 
fail to reward those risky projects that are most 
likely to be highly influential in their field (/, 2). 

We don’t find evidence that the peer-review 
system adds value beyond previous publications 
and qualifications in terms of screening out low- 
citation papers. Better percentile scores are asso- 
ciated with slightly more publications in the 
bottom 50% of the citation distribution. There 
is no discernible relationship between residual 
publications in the bottom 20% and peer-review 
score among the funded grants in our sample, sug- 
gesting that while these less influential anticipated 
publications are not rewarded by the peer-review 
system, they are also not specifically penalized. 

Our findings demonstrate that peer review 
generates information about the quality of appli- 
cations that may not be available otherwise. This 
does not mean that the current NIH review sys- 
tem would necessarily outperform other alloca- 
tion mechanisms that do not rely on expert peer 
evaluations. Our analysis focuses on the relation- 
ship between scores and outcomes among funded 
grants; for that reason, we cannot directly assess 
whether the NIH systematically rejects high- 
potential applications. Our results, however, sug- 
gest that this is unlikely to be the case, because 
we observe a positive relationship between better 
scores and higher-impact research among the set 
of funded applications. 

Although our findings show that NIH grants 
are not awarded purely for previous work or elite 
affiliations and that reviewers contribute valuable 
insights about the quality of applications, mis- 
takes and biases may still detract from the qual- 
ity of funding decisions. We have not included an 


accounting of the costs of peer review, most no- 
tably the time investment of the reviewers. These 
bibliometric outcomes may not perfectly capture 
NIH objectives or be the only measures relevant 
for evaluating social welfare; ideally, we would like 
to link grants with health and survival outcomes, 
but constructing those measures is difficult and 
beyond the scope of this paper. Future research 
may focus on whether the composition of peer- 
review committees is important to determining 
their success, including evaluator seniority and 
the breadth and depth of committee expertise. 
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VACCINES 


An Ebola whole-virus vaccine is 
protective in nonhuman primates 
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Zaire ebolavirus is the causative agent of the current outbreak of hemorrhagic fever 
disease in West Africa. Previously, we showed that a whole Ebola virus (EBOV) vaccine 
based on a replication-defective EBOV (EBOVAVP30) protects immunized mice and guinea 
pigs against lethal challenge with rodent-adapted EBOV. Here, we demonstrate that 
EBOVAVP30 protects nonhuman primates against lethal infection with EBOV. Although 
EBOVAVP30 is replication-incompetent, we additionally inactivated the vaccine with 
hydrogen peroxide; the chemically inactivated vaccine remained antigenic and protective 
in nonhuman primates. EBOVAVP30 thus represents a safe, efficacious, whole-EBOV 
vaccine candidate that differs from other EBOV vaccine platforms in that it presents all 
viral proteins and the viral RNA to the host immune system, which might contribute to 


protective immune responses. 


he Ebola virus (EBOV) outbreak in West 

Africa has already claimed more than 5000 

lives (J) and remains uncontrolled. One coun- 

termeasure to mitigate Ebola virus infec- 

tions is vaccination. Several Ebola virus 
vaccine platforms have been developed over the 
last decades (2), three of which recently advanced 
to clinical trials: a DNA-based vaccine expressing 
different Ebola virus glycoproteins (GPs, the ma- 
jor Ebola virus immunogen) (3, 4), a replication- 
incompetent chimpanzee adenovirus expressing 
GP (5), and a live-attenuated vesicular stomatitis 
virus (VSV) expressing GP (5). The DNA platform 
completely protects nonhuman primates (the “gold 
standard” for Ebola virus research) only after mul- 
tiple dosages of the DNA vaccine in combination 
with recombinant adenovirus (6), but has not 
been tested as a standalone vaccination strategy. 
The recombinant adenovirus platform (including 
the recently developed recombinant chimpanzee 
adenovirus) requires high vaccine doses and boost- 
ing to achieve complete and durable protection 
of nonhuman primates against lethal challenge 
with EBOV (7, 8). Complete protection of non- 
human primates against lethal EBOV challenge 
has also been accomplished with the VSV plat- 
form; however, the use of a replicating recombi- 
nant VSV (9-12) may be of concern because of 
issues related to vaccine safety. Hence, although 
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several platforms are being tested in clinical trials, 
additional options should be explored. 
Whole-virus vaccines (either live attenuated or 
inactivated) have a long history as successful hu- 
man vaccines, offering protection against poten- 
tially deadly viral diseases such as smallpox, 
influenza, mumps, and measles (13). Whole-virus 
vaccines present multiple viral proteins and the 
viral genetic material to the host immune sys- 
tem, which may trigger a broader and more ro- 
bust immune response than vectored vaccines 
that present only single viral proteins. However, 
initial attempts to develop a gammaz-irradiated, 
inactivated whole-EBOV vaccine failed to provide 
robust protection of nonhuman primates against 
challenge with a lethal dose of EBOV (J4). 
Previously, we developed a replication-defective 
EBOV (termed EBOVAVP30) which is based on 
the Mayinga strain of EBOV and lacks the coding 
region for the essential viral transcription acti- 
vator, VP30 (15). EBOVAVP30 replicates to high 
titers in cell lines that stably express the VP30 
protein, is genetically stable, and is nonpatho- 
genic in rodents (15, 16). Mice and guinea pigs 
immunized twice with EBOVAVP30 were fully 


protected against a lethal challenge with mouse- 
or guinea pig-adapted EBOV, respectively (16). 
EBOVAVP30 is a biosafety level-3 agent and ex- 
empt from “Select Agent” status; an EBOVAVP30 
vaccine could therefore be manufactured in ex- 
isting biosafety level-3 facilities that operate under 
good manufacturing practices. 

To assess the effectiveness of EBOVAVP30 
whole-virus vaccine in nonhuman primates, we in- 
oculated groups of cynomolgus macaques (Table 1) 
intramuscularly (i.m.) with Dulbecco’s modified 
essential medium (DMEM) (control, group 1), a 
single dose of 10” focus-forming units (FFU) of 
EBOVAVP30 (group 2), or two doses of 10’ FFU 
of EBOVAVP30 4 weeks apart (group 3). Previ- 
ously, we demonstrated the genomic stability of 
EBOVAVP30 by carrying out three independent 
experiments that each comprised seven consec- 
utive passages of the virus in VeroVP30 cells. 
After the last passages, we sequenced the region 
surrounding the VP30 deletion site and did not 
detect any recombination events or mutations. 
Moreover, the passaged viruses did not grow in 
wild-type cells, further indicating the lack of re- 
combination. Despite these findings, concerns 
have been raised that such an event could poten- 
tially affect vaccine safety. Recently, virus inac- 
tivation with hydrogen peroxide was shown to 
preserve the antigenicity of lymphocytic chorio- 
meningitis (7, 18), vaccinia (17), West Nile (77, 19), 
and influenza (20) viruses. To increase the bio- 
safety profile of EROVAVP30, we therefore treated 
it with hydrogen peroxide (H2O., 3% final con- 
centration) for 4 hours on ice, followed by viral 
plaque assays in VP30-expressing cells, which con- 
firmed complete virus inactivation. Nonhuman 
primates were then vaccinated twice with 10’ 
FFU of the H,O,-treated EBOVAVP30 (group 4; 
two animals). Gamma-irradiation is an established 
procedure for Ebola virus inactivation, but irra- 
diation conditions optimized for virus inactiva- 
tion (rather than for antigenic epitope preservation) 
may alter antigenicity and therefore protective 
efficacy of Ebola virus vaccines (14). To test these 
concepts, we also vaccinated macaques twice with 
10’ FFU of wild-type EBOV gamma-irradiated in 
BSL: containment (group 5); again, the irradiation 
conditions used here ensured virus inactivation, but 


Table 1. Overview of vaccination and challenge strategy. 


Group Vaccine _ Inactivation _ Vaceanation Protection Euthanasia 
Prime Boost 
O%t 
x _ -_ _ t 
Group 1 Mock (n= A) Days 7 to 8 
0, 
Group 2. EBOVAVP30 - 1x10’FFU = pa “ N/AS 
10) 
Group 3 EBOVAVP30 _ 1x10’ FFU 1x 10’ FFU es i N/A 
0 
Group 4 EBovavp30 YrgeN 4 ag? FU 1x 107 FFU 100% N/A 
peroxide (n= 2) 
Gamma- 7 7 0% 
Group 5 EBOV feradiation 1x10’ FFU 1x 10’ FFU (n= 4) Days 6 to 9 
*DMEM. +Percentage of animals that survived challenge with a lethal dose of EBOV. {Days after 
challenge. 8N/A, not applicable. 
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were not optimized to preserve antigenicity. None 
of the vaccinated animals showed signs of ill- 
ness, confirming our earlier data from mice and 
guinea pigs that EBOVAVP30 is nonpathogenic 
in animals (6). 

Four weeks after the last immunization, we 
challenged animals in BSL-4 containment i.m. 
with a lethal dose (1000 FFU) of the heterologous 
Kikwit strain of EBOV. While control macaques 
in group 1 had to be euthanized on day 7 or 8 
after challenge according to established and ap- 
proved humane endpoint criteria (27) (Table 1), 
all animals immunized once (group 2) or twice 
(group 3) with the EBOVAVP30 vaccine survived 
the lethal challenge (Table 1). In addition, both 
animals immunized twice with H2O.-treated 
EBOVAVP30 vaccine (group 4) survived infection 
with wild-type EBOV, indicating that H,O,-treated 
EBOVAVP30 is immunogenic and elicits protec- 
tive immune responses. In contrast, all macaques 
immunized with gamma-irradiated wild-type EBOV 
(group 5) developed signs of severe EBOV disease 
and had to be euthanized between days 6 and 9 
after challenge (Table 1), supporting the concept 
that gamma-irradiation optimized for virus inac- 
tivation alters the immunogenicity of EBOV vac- 
cines. The macaques that had to be euthanized 
after challenge with EBOV (groups 1 and 5) had 
high virus titers in their blood after challenge 
(Fig. 1). In contrast, no viremia was detected in 
animals immunized twice with untreated (group 
3) or H,O,-treated EBOVAVP30 (group 4) (Fig. 
1), showing that H,O.-treated EBOVAVP30 elic- 
ited a protective immune response. One of four 
animals that received a single immunization with 
EBOVAVP30 [nonhuman primate (NHP) 8 in 
group 2] was viremic on days 3 and 6 after chal- 
lenge, but cleared the virus on day 9 (Fig. 1). In 
addition, a different animal in group 2 (NHP 7) 
had a fever on day 6 after challenge (table S1). 
These data indicate that a single vaccination with 
EBOVAVP30 does not always prevent EBOV rep- 
lication or signs of illness (fever), but does pro- 
tect the host from death upon EBOV challenge. 
Together, our findings demonstrate the vaccine 
potential of a whole-EBOV vaccine based on 
EBOVAVP30. 

To better understand the correlates of protec- 
tion, we measured the immune responses 2 and 
4 weeks after the last immunization (i.e., 2 weeks 
and immediately before EBOV challenge). Two 
weeks after the last vaccination (day -14), ma- 
caques immunized twice with EBOVAVP30 (group 
3) had a high immunoglobulin G (IgG) antibody 
response to the viral GP based on a GP-specific 
enzyme-linked immunosorbent assay (ELISA) 
assay (Fig. 2). Two immunizations with H2O>- 
treated EBOVAVP30 (group 4) resulted in a slight- 
ly lower, but still robust, immune response (Fig. 2). 
In macaques immunized once with EBOVAVP30 
(group 2), we detected a low, but measurable, IgG 
antibody response (Fig. 2). Serum samples from 
animals that succumbed to EBOV challenge— 
namely, those mock-immunized (group 1) or 
immunized twice with gamma-irradiated wild- 
type EBOV (group 5)—did not possess measur- 
able IgG titers to GP (Fig. 2). The IgG titers to 
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EBOV GP on the day of challenge (day 0, Fig. 2) 
followed the same trend, but were low. The IgG 
titers to EBOV GP closely mirrored survival rates 
and virus titers (Table 1 and Fig. 1); these data 
indicate that immunization with EBOVAVP30 
elicits an antibody response to GP that is impor- 
tant for protection against EBOV infection. A 
similar correlation between a GP-specific anti- 
body response and protection has been demon- 
strated with other experimental EBOV vaccine 
platforms (22, 23). 

The antibody repertoire was further character- 
ized by assessing the levels of neutralizing anti- 
bodies to GP as measured by plaque reduction 
neutralization (PRNT) assays. The serum dilution 
that reduced the titer of VSV-expressing EBOV 
GP by =50% (plaque reduction neutralization 
titer 50, PRNT;9) was 1:20 to 1:40 for samples 
obtained from animals immunized twice with 
EBOVAVP30 (group 3; table S2); no statistically 
significant decline in neutralizing antibody levels 
was detected between day -14 (2 weeks before 
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107 107 
z 10° 10% 
105 1 
— 0 0 
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10! 10 
10° 10° 
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challenge) and day 0 (table S2). In contrast, we 
detected slightly lower PRNTso titers of ~1:10 for 
sera obtained from animals immunized once with 
untreated or H.O.-treated EBOVAVP30 (groups 
2 or 4, respectively; table S2). No neutralizing 
antibodies were detected in control animals or 
animals immunized twice with gamma-irradiated 
EBOV (groups 1 or 5, respectively; table S2). Over- 
all, the neutralizing antibody titers were low, but 
similar to those detected upon vaccination of 
animals with VSV-expressing EBOV GP (11). 
Most experimental Ebola virus vaccine plat- 
forms provide only the viral GP as antigen, ex- 
pressed from recombinant viruses or protein 
expression plasmids; in contrast, the EROVAVP30 
vaccine presents all viral proteins plus the viral 
genetic material to the host. Early studies with 
EBOV-like particles (VLPs) suggested that the 
viral matrix protein (VP40) and nucleoprotein 
(NP) are also immunogenic (24), prompting us 
to carry out ELISAs specific for these two viral 
proteins. Two weeks after the last vaccination 


10°41 Group 3 


345678 9 01234667 8 9 
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Days post-challenge 


Fig. 1. Virus titers in the 
blood of infected nonhuman 
primates. Animals were 
immunized as shown in Table 1. 
Four weeks after the last immu- 
nization, animals were infected 
with a lethal dose of EBOV. 
Shown are EBOV titers in the 


blood of individual nonhuman primates from each group. Virus titers are shown as 50% tissue culture 


infective dose (TCIDs0). 
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Fig. 2. Immune responses in vaccinated nonhuman primates. |gG antibody responses to EBOV GP 
2 weeks after the last vaccination (day -14) and on the day of challenge (day 0). Antibody titers were 
measured with an ELISA specific for EBOV GP. Titers shown are the highest reciprocal dilution that 


resulted in an optical density (OD) of =0.2. 
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(day -14), macaques immunized twice with un- 
treated (group 3) or H2O.-treated (group 4) 
EBOVAVP30 had high NP and VP40 antibody 
titers (fig. S1). Lower, but still robust, NP and 
VP40 antibody titers were observed in macaques 
immunized once with EBOVAVP30 (group 1). Con- 
trary to the GP antibody titers, we also detected 
NP and VP40 antibodies in animals immunized 
twice with gamma-irradiated EBOV (group 5), 
suggesting that gamma-irradiation under condi- 
tions optimized for virus inactivation has a greater 
effect on the antigenicity of GP epitopes than on 
that of NP and VP40 epitopes. Collectively, these 
data demonstrate that antibodies to NP and VP40 
are elicited after vaccination with EBOVAVP30 
and that the levels of these antibodies are higher 
in protected animals than in those that succumbed 
to infection. However, the importance of NP and 
VP40 antibodies to protection from EBOV infec- 
tion is not yet known. 

In addition to the antibody response, we also 
measured the cellular immune response by ex- 
amining the number of mononuclear cells pro- 
ducing interferon-y (IFN-y). On day -14 (2 weeks 
before challenge), animals in groups 2 and 3, 
immunized one or twice with EBOVAVP30, re- 
spectively, had the highest number of IFN-y- 
producing cells (fig. S2). Although treatment of 
EBOVAVP30 with H2O2 (group 4) reduced the 
number of IFN-y-producing cells, more IFN-y- 
producing cells were detected in these animals 
compared with those immunized twice with gamma- 
irradiated EBOV (group 5; fig. S2) or left un- 
treated (group 1; fig. $2). 

Data from Geisbert et al. (J4) and our present 
findings suggest that gamma-irradiation opti- 
mized to inactivate EBOV destroys the antigenicity 
of wild-type EBOV, particularly in EBOV GP. H,O.- 
treated EBOVAVP30, however, elicited a robust 
IgG response and protected nonhuman primates 
against lethal EBOV challenge, although H.O2 
treatment resulted in a slight reduction of anti- 
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genicity compared with untreated virus (Fig. 2). 
Hence, H2O2 treatment of EROVAVP30 appears 
to preserve key antigenic epitopes, as has been 
demonstrated for other viruses (17-20). To exam- 
ine potential differences in antigenicity between 
gamma-irradiated and H.O.-treated virus, we per- 
formed an ELISA-based assay, using a panel of 
19 monoclonal antibodies (mAbs) directed against 
GP. Most mAbs showed levels of binding compa- 
rable to that of GP; however, four (mAbs 12, 21, 
226, and 662) reacted more efficiently with H,O.- 
treated than with gamma-irradiated virus (Fig. 3). 
Most likely, gamma-irradiation affected the con- 
formation of the epitopes recognized by these 
antibodies, resulting in the lack of protection 
upon immunization with gamma-irradiated vi- 
rus. Hence, the epitopes recognized by mAbs 12, 
21, 226, and 662 may play an important role in 
antibody-mediated protection in immunized ma- 
caques and potentially in humans; indeed, mAb 
226 is known to have virus-neutralizing proper- 
ties (25). One mAb (1031) interacted more efficiently 
with gamma-irradiated than with H,O,-treated 
virus, while a polyclonal antiserum reacted simi- 
larly with both virus preparations tested (Fig. 3). 

When EBOV was first discovered over 35 years 
ago, whole-virus vaccines inactivated by formalin 
or gamma-irradiation were tested, but failed to 
elicit complete protection in nonhuman primates 
(14). The development of whole-virus vaccines 
was therefore abandoned, and VLPs composed 
of GP and VP40 (and NP) were explored as a safe 
and immunogenic platform to present several viral 
proteins to the host immune system (2, 26-28). 
These VLPs are immunogenic, but three vacci- 
nations with adjuvanted VLPs were required to 
achieve protective efficacy in nonhuman primates 
(24). Here, we present a vaccine strategy that 
offers several advantages: (i) It provides protec- 
tion from a lethal challenge of EBOV in nonhuman 
primates after a single immunization, although 
one animal became viremic and another animal 
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Fig. 3. Effects of H202-treatment and gammza-irradiation on the antigenicity of EBOV GP. Using a 
panel of 19 mAbs (1 ug/ml) directed against EBOV GP, we performed an ELISA to examine the antigenicity 
of gamma-irradiated EBOV (blue) and H202-treated EBOVAVP30 (red). 
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developed a fever; (ii) it is highly immunogenic, 
as shown by robust antibody responses elicited 
upon vaccination; (iii) it is amenable to large- 
scale production, because EBOVAVP30 grows to 
titers of >10’ FFU/ml in VP30-expressing cells (15); 
(iv) it is safe, owing to its inability to replicate 
outside VP30-expressing cells (15); and (v) it 
presents all viral proteins and its genomic RNA 
to the host, similar to whole-virus vaccines and 
VLPs. It should be noted that NHPs immunized 
once with EBOVAVP30 (group 2) were protected 
from a lethal EBOV challenge, although two of 
the four animals showed signs of illness (fever 
was detected in NHP 7, and viremia was detected 
in NHP 8; table S1). However, all four animals in 
group 2 (NHPs 5 to 8) showed similar immune 
responses (table S2 and summarized in table S3). 

To address any potential concerns over re- 
combination events that would restore the repli- 
cative ability of EBOVAVP30, we also chemically 
inactivated it with H,O,. Hydrogen peroxide treat- 
ment causes breaks in single- and double-stranded 
DNA or RNA (17) and thus inactivates viruses 
without affecting their antigenicity. By contrast, 
gamma-irradiation (used to generate the first ex- 
perimental whole EBOV vaccine) causes the (de) 
hydroxylation of amino acids, the cleavage of 
polypeptide backbones (29), and the generation 
of free radicals that could cause the destruction 
of the antigenic properties of some epitopes. These 
differences in mechanism may explain why viruses 
treated with H,O, are more immunogenic than 
those irradiated with gamma rays; however, op- 
timization of irradiation conditions may improve 
the immunogenicity of vaccine candidates. 

In summary, our data indicate that EROVAVP30 
is an effective whole-EBOV vaccine that warrants 
further assessment. 
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GENOME EDITING 


The mutagenic chain reaction: A 
method for converting heterozygous 
to homozygous mutations 


Valentino M. Gantz* and Ethan Bier* 


An organism with a single recessive loss-of-function allele will typically have a wild-type 
phenotype, whereas individuals homozygous for two copies of the allele will display a 
mutant phenotype. We have developed a method called the mutagenic chain reaction 
(MCR), which is based on the CRISPR/Cas9 genome-editing system for generating 
autocatalytic mutations, to produce homozygous loss-of-function mutations. In Drosophila, 
we found that MCR mutations efficiently spread from their chromosome of origin to the 
homologous chromosome, thereby converting heterozygous mutations to homozygosity 
in the vast majority of somatic and germline cells. MCR technology should have broad 


applications in diverse organisms. 


tis often desirable to generate recessive loss- 
of-function mutations in emergent model 
organisms; however, identifying such muta- 
tions in the heterozygous condition is chal- 
lenging. Taking advantage of the CRISPR/ 


Fig. 1. Scheme outlining the mutagenic chain 
reaction (MCR). (A to C) A plasmid consisting of a 
core cassette carrying a Cas9 transgene, a gRNA 
targeting a genomic sequence of interest, and 
flanking homology arms corresponding to genomic 
sequences abutting the target cleavage site (A) 
inserts the core Cas9-gRNA cassette into the 
targeted locus via HDR [(B) and (C)]. (D to F) In 
turn, the inserted cassette expresses both Cas9 
and the gRNA, leading to cleavage (D) and HDR- 
mediated insertion of the cassette into the second 
allele, thereby rendering the mutation homozygous 
[(E) and (F)]. HA1l and HA2 denote the two 
homology arms that directly flank the gRNA- 
directed cut site. 
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Cas9 genome-editing method (J, 2), we have 
developed a strategy to convert a Drosophila 
heterozygous recessive mutation into a homozy- 
gous condition manifesting a mutant phenotype. 
We reasoned that autocatalytic insertional mutants 
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could be generated with a construct having three 
components: (i) A Cas9 gene (expressed in both 
somatic and germline cells), (ii) a guide RNA 
(gRNA) targeted to a genomic sequence of in- 
terest, and (iii) homology arms flanking the 
Cas9-gRNA cassettes that match the two ge- 
nomic sequences immediately adjacent to either 
side of the target cut site (Fig. 1A). In such a 
tripartite construct, Cas9 should cleave the ge- 
nomic target at the site determined by the gRNA 
(Fig. 1A) and then insert the Cas9-gRNA cassette 
into that locus via homology-directed repair (HDR) 
(Fig. 1, B and C). Cas9 and the gRNA produced 
from the insertion allele should then cleave the 
opposing allele (Fig. 1D), followed by HDR- 
driven propagation of the Cas9-gRNA cassette 
to the companion chromosome (Fig. 1, E and F). 
We refer to this trans-acting mutagenesis scheme 
as a mutagenic chain reaction (MCR). 

We expected that autocatalytic allelic conver- 
sion by MCR should be very efficient in both 
somatic and germline precursor cells, given the 
high frequency and specificity of mutagenesis (3) 


Section of Cell and Developmental Biology, University of 
California, San Diego, La Jolla, CA 92095, USA. 
*Corresponding author. E-mail: vgantz@ucsd.edu (V.M.G.); 
ebier@ucsd.edu (E.B.) 


D 
oy 


second allele 


& — 
+ + 
Cas9 gRNA 


E Homology Directed Repair (HDR) 


HA1 Cas9 gRNA HA2 
F 
Cas9 gRNA 
Cas9 gRNA 


sciencemag.org SCIENCE 


RESEARCH | REPORTS 


and efficacy of homology-based integration (4) 
mediated by separate genome-encoded Cas9 and 
gRNA genes observed in previous studies. We 
tested this prediction in D. melanogaster with 
the use of a characterized efficient target se- 
quence (yZ) (5) in the X-linked yellow (y) locus 
as the gRNA target and a vasa-Cas9 transgene 
as a source of Cas9 (Fig. 2C) because it is ex- 
pressed in both germline and somatic cells (4). 
As the defining element of our MCR scheme, 
we also included two homology arms, ~1 kb each, 
flanking the central elements (Fig. 2C) that pre- 
cisely abut the gRNA-directed cut site. Wild-type 
(y+) embryos were injected with the y-MCR 
element (see supplementary materials), and emerg- 
ing Fo flies were crossed to a y+ stock. According 
to Mendelian inheritance, all F, female prog- 
eny of such a cross should have a y+ phenotype 
(i.e., F, females inherit a y+ allele from their 
wild-type parent). 

From two independent Fy male (3) x y+ fe- 
male (2) crosses and 7 Fo x y+ crosses, we 
recovered y- F,2 progeny, which should not 
happen according to Mendelian inheritance of a 
recessive allele. Six such y@“™ F,Q were crossed 
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individually to y+, resulting in 95 to 100% 
(average = 97%) of their F, progeny exhibiting a 
full-bodied y- phenotype (Fig. 2, E and G, and 
table S1), in contrast to the expected rate of 50% 
(i.e., only in males). We similarly tested MCR 
transmission via the germline in two y- F,¢ 
recovered from an Fo cross that also yielded 
y- female siblings. These y- F, were considered 
candidates for carrying the y-MCR construct and 
were crossed to y+ females. All but one of their F2 
female progeny had a full-bodied y- phenotype 
(Fig. 2, E and F). Occasionally among y“ F,9 
we also recovered mosaics (~4%) with a few small 
y+ patches as well as a lone example of a 50% 
chimeric female (Fig. 2H), and in two instances, 
we recovered y+ male progeny from a y“* F,9 
mother (Fig. 2E and table S1). These infrequent 
examples of imperfect y-MCR transmission indi- 
cate that although HDR is highly efficient at this 
locus in both somatic and germline lineages, the 
target occasionally evades conversion. 
Polymerase chain reaction (PCR) analysis of 
the y locus in individual y- F, progeny confirmed 
the precise gRNA- and HDR-directed genomic 
insertion of the y-MCR construct in all flies giving 
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Fig. 2. Experimental demonstration of MCR in Drosophila. (A) Mendelian male inheritance of an 
X-linked trait. (B) Theoretical MCR-based inheritance results in the initially heterozygous allele con- 
verting the second allele, thereby generating homozygous female progeny. (C) Diagram of y-MCR construct. 
Two y locus homology arms flanking the vasa-Cas9 and y-gRNA transgenes are indicated, as are the 
ocations of the PCR primers used for analysis of the genomic insertion site (see supplementary materials). 
(D) PCR analysis of a y+ MCR-derived F2d (lanes 1 to 3; see fig. S1 for sequence), y““R F,2 (lanes 4 to 6), and 
yMCR F. (lanes 7 to 9) showing junctional bands corresponding to y-MCR insertion into the chromosomal y 
locus (lanes 2, 3, 5, 6, 8, and 9) and the presence (lanes 1 and 4) or absence (lane 7) of a PCR band derived 
from the y locus. Although the yM® Fy (carrying a single X chromosome) displays only MCR-derived PCR 
products (lanes 8 and 9), y“°® F,9s generate both MCR and noninsertional amplification products. (E) 
Summary of Fs progeny obtained from crosses described in table Sl. (F) Low-magnification view of F> 
progeny flies from an y“CR gx y+Q cross. Nearly all female progeny display a y— phenotype. (G) High- 
magnification view of a full-bodied y“°R F,9. (H) A rare 50% left-right mosaic female. (I) A y+ control fly. 
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rise to y- female F, progeny (Fig. 2D). Males car- 
ried only this single allele, as expected, whereas 
females in addition possessed a band correspond- 
ing to the size of the wild-type y locus (Fig. 2D, 
lane 4), which varied in intensity between indi- 
viduals, indicating that females were mosaic for 
MCR conversion. The left and right y-MCR PCR 
junction fragments were sequenced from y- F, 
progeny from five independent Fp parents. All 
had the precise expected HDR-driven insertion 
of the y-MCR element into the chromosomal y 
locus. In addition, sequence analysis of a rare 
nonconverted y+ allele recovered in a male off- 
spring from ay F,° (Fig. 2E) revealed a single- 
nucleotide change at the gRNA cut site (resulting 
in a TI substitution), which most likely resulted 
from nonhomologous end-joining repair, as well 
as an in-frame insertion-deletion (indel) in a y+2 
sibling of this male (fig. S1 and table S1). The 
high recovery rate of full-bodied y- F, and F, 
female progeny from single parents containing 
a yMCR allele detectable by PCR indicates that 
the conversion process is remarkably efficient in 
both somatic and germline lineages. Phenotypic 
evidence of mosaicism in a small percentage of 
MCR-carrying females and the presence of y 
locus-derived PCR products of wild-type size in 
all tested y- F, females suggest that females 
may all be mosaic to varying degrees. In summary, 
both genetic and molecular data reveal that the 
y-MCR element efficiently drives allelic conver- 
sion in somatic and germline lineages. 

MCR technology should be applicable to dif- 
ferent model systems and a broad array of sit- 
uations, such as enabling mutant F, screens in 
pioneer organisms, accelerating genetic manipu- 
lations and genome engineering, providing a po- 
tent gene drive system for delivery of transgenes 
in disease vector or pest populations, and po- 
tentially serving as a disease-specific delivery sys- 
tem for gene therapy strategies. We provide an 
example in this study of an MCR element caus- 
ing a viable insertional mutation within the cod- 
ing region of a gene. It should also be possible, 
however, to efficiently generate viable deletions 
of coding or noncoding DNA by including two 
gRNAs in the MCR construct targeting separated 
sequences and appropriate flanking homology 
arms. Using the simple core elements tested in 
this study, MCR is applicable to generating homo- 
zygous viable mutations, creating regulatory mu- 
tations of essential genes, or targeting other 
nonessential sequences. The method may also be 
adaptable to targeting essential genes if an in- 
frame recoded gRNA-resistant copy of the gene 
providing sufficient activity to support survival 
is included. 

In addition to these positive applications of 
MCR technology, we are also keenly aware of the 
substantial risks associated with this highly inva- 
sive method. Failure to take stringent precautions 
could lead to the unintentional release of MCR 
organisms into the environment. The supple- 
mentary material includes a stringent, institu- 
tionally approved barrier containment protocol 
that we developed and are currently adhering 
to for MCR experiments. Since this study was 
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submitted for publication, a preprint has been 
posted on the bioRxiv web server showing that a 
split Cas9-gRNA gene drive system efficiently 
biases inheritance in yeast (6). The split system 
was used to avoid accidental escape of the gene 
drives. The use of a similar strategy in future 
MCR organisms would reduce, but not eliminate, 
risks associated with accidental release. We 
therefore concur with others (7, 8) that a dia- 
logue on this topic should become an immediate 
high-priority issue. Perhaps, by analogy to the fa- 
mous Asilomar meeting of 1975 that assessed 
the risks of recombinant DNA technology, a sim- 
ilar conference could be convened to consider 
biosafety measures and institutional policies ap- 
propriate for limiting the risk of engaging in 
MCR research while affording workable oppor- 
tunities for positive applications of this concept. 
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PROTEIN FOLDING 


Translational tuning optimizes 
nascent protein folding in cells 


Soo Jung Kim,’ Jae Seok Yoon,’ Hideki Shishido,’ Zhongying Yang,’ LeeAnn A. Rooney,’ 


Jose M. Barral,”? William R. Skach’** 


In cells, biosynthetic machinery coordinates protein synthesis and folding to optimize 
efficiency and minimize off-pathway outcomes. However, it has been difficult to delineate 
experimentally the mechanisms responsible. Using fluorescence resonance energy 
transfer, we studied cotranslational folding of the first nucleotide-binding domain from the 
cystic fibrosis transmembrane conductance regulator. During synthesis, folding occurred 
discretely via sequential compaction of N-terminal, a-helical, and a/B-core subdomains. 
Moreover, the timing of these events was critical; premature a-subdomain folding 
prevented subsequent core formation. This process was facilitated by modulating intrinsic 
folding propensity in three distinct ways: delaying o-subdomain compaction, facilitating 
B-strand intercalation, and optimizing translation kinetics via codon usage. Thus, de novo 
folding is translationally tuned by an integrated cellular response that shapes the 
cotranslational folding landscape at critical stages of synthesis. 


ost proteins must acquire a defined 
three-dimensional structure in order to 
function. Folding pathways that gener- 
ate these structures have primarily been 
characterized by using model substrates 
that fold rapidly, spontaneously, and reversibly 
in vitro (J, 2). In cells, however, protein folding 
is kinetically coupled to synthesis as the nascent 
polypeptide emerges from the ribosome. Whereas 
certain small proteins may remain unstructured 
during synthesis (3), many complex proteins ex- 
hibit length-dependent folding intermediates whose 
structural properties (4) and/or folding efficien- 
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cies (5) deviate from those observed in vitro. In 
such cases, the folding energy landscape, as well 
as folding outcome, can be influenced by ribo- 
some effects (4, 6, 7), polypeptide elongation rate 
(8-10), molecular crowding (JI, 12), and cotrans- 
lational interactions with cellular chaperones (13, 14). 
Indeed, cotranslational constraints can bias kinet- 
ically competing folding events to generate alter- 
nate stable structures with different functional 
properties (8, 15, 16). Despite improved compu- 
tational methods, few principles have been es- 
tablished experimentally to explain how biosynthetic 
parameters influence specific folding events and 
outcome (3, 4, 17-19). 

To address this issue, we used fluorescence 
resonance energy transfer (FRET) to examine 
structural transitions of ribosome-bound folding 
intermediates generated through in vitro trans- 
lation of truncated RNA transcripts. This approach 
derives from the principle that during folding, 
certain residues distant in primary structure 
are brought into close proximity, increasing the 


FRET efficiency between donor and acceptor flu- 
orophores that are cotranslationally incorporated 
into the nascent polypeptide (Fig. 1A) (78, 19). 
Here, the donor fluorophore, cyan fluorescent pro- 
tein (CFP), was fused to the N terminus of the first 
nucleotide-binding domain (NBD1) from the 
cystic fibrosis transmembrane conductance reg- 
ulator (CFTR), and a small acceptor dye was in- 
corporated at surface-exposed residues (UAG 
codons) by using a synthetic suppressor tRNA 
(figs. S1 and S2). FRET measurements obtained 
at sequential nascent chain lengths thus provide 
conformational snapshots into the equilibrium 
ensemble of stably arrested ribosome-bound nas- 
cent chains in the context of their native biosyn- 
thetic machinery (Fig. 1A) (3, 17-20). 

Using this system, we defined the cotransla- 
tional folding pathway of CFTR NBDI1, whose de- 
fective folding causes cystic fibrosis (21-24). NBD1 
contains three subdomains (N-terminal, a-helical, 
and parallel-F1-type-B-sheet core) and exhibits a 
complex vectoral topography that limits CFTR 
maturation (22, 25) and prevents reversible fold- 
ing in vitro. To examine its cotranslational fold- 
ing pathway, FRET acceptor sites were chosen 
within 4 to 9 A of the CFP fusion (Thr®°’) (26) 
to report on the positioning of strands S3, S6, 
87, and S8 in the B-sheet core (Fig. 1, B and C). 
Analysis of sequentially stalled polypeptides 
yielded a characteristic length-dependent rise 
and plateau in FRET for each acceptor site (Fig. 
1D). This rise in FRET reports on acquisition of a 
native-like fold (19) and reflects the earliest bio- 
synthetic stage at which the acceptor dye and its 
corresponding f-strand are optimally positioned 
within NBD1. Results show that S3, S6, S7, and 
S8 could therefore reach a native-like structure 
when the ribosome has synthesized residues 550, 
624, 654, and 674, respectively (Fig. 1D), although 
actual folding intermediates will depend on rela- 
tive folding kinetics and translation elongation rate. 

Despite their proximity, S6 exhibited a more 
gradual rise in FRET and was optimally positioned 
at a later stage of synthesis than was S3 (Fig. 1D). 
Because the ribosome exit tunnel sequesters ap- 
proximately 40 residues, optimal S6 positioning 
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Fig. 1. FRET-detected cotranslational compaction of NBD1 subdomains. (A) 
Schematic of ribosome-bound nascent chain at sequential stages of synthesis, 
showing change in distance (d) between donor and acceptor probes as polypeptide 
transitions from unfolded (middle) to folded (right) conformation. (B) Location of 
acceptor probes (Arg*°°, Arg*®”, Asp®®”, and Asn®°?”) in NBD1 primary structure. 
Helices and B-strands are drawn as cylinders and filled arrows, respectively. (©) Acceptor probe sites in NBD1 crystal structure [Protein Data Bank (PDB) 2BBO]. 
(D) FRET efficiency (Errer) plotted as a function of nascent chain length for indicated acceptor probe (n = 3 + SEM or average of n = 2 replicates). Dotted line 
indicates synthetic stage needed to achieve maximal FRET. Shown on right is the minimal polypeptide outside the ribosome required to optimally position S3 
(Arg*°°), S6 (Arg*®”), S7 (Asp®°”), and S8 (Asn°?”). 
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Fig. 2. Ribosome delays o-subdomain folding and facilitates p-sheet core formation. (A, C, and E) Illustration showing hypothetical folding outcomes of 
ribosome-bound and released polypeptides with equivalent cytosolically exposed residues. (B, D, and F) Errer obtained for ribosome released (black solid line) 
and ribosome-bound (red dotted line from Fig. 1D) polypeptides plotted against number of cytosolic residues (#) C-terminal to the acceptor site (n = 3 + SEM or 
average of n = 2 replicates) located at residues 450, 487, 567, in (B), (D), and (F), respectively. (G) Illustration depicting (i) role of ribosome in delaying S6 and 
a-subdomain folding to facilitate S7 and S8 insertion and (ii) resultant NBD1 misfolding due to premature S6/a-subdomain folding in the absence of ribosome. 
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Fig. 3. CFTR folding dependence on NBD1 translation elongation rate. 
(A) Predicted translation elongation rate (10) calculated as a 15—amino acid moving 
window average, for WT NBD1 (black), Slow,s25.593) (red), and Fast(s25.593) (blue) 
aligned with NBD1 secondary structural elements as they emerge from the ribo- 
some. (B) Autoradiogram of [°°S]Met-labeled WT, Slows25-593), and Fast625.593) 
NBDI1. Graph shows fold change in protein compared with that of WT at 5 min 
(n = 3 replicates + SEM). (C) CFTR immunoblot from human embryonic kidney 


was therefore delayed until the entire o-subdomain 
and $7 emerged into the cytosol (Fig. 1D). To un- 
derstand this delay, we compared the confor- 
mation of S3 and S6 in ribosome-bound versus 
ribosome-released polypeptides, where each con- 
struct contained equivalent cytosolically exposed 
residues (Fig. 2) (19, 27). Under these conditions, 
ribosome attachment had no detectable impact 
on N-terminal subdomain folding (Fig. 2, A and 
B, and fig. S3A). In contrast, S6 exhibited a high- 
er FRET efficiency and achieved a native-like con- 
formation at shorter chain lengths (by nearly 50 
residues) in the ribosome-free state (Fig. 2, C and 
D, and fig. S3B). Ribosome attachment there- 
fore delayed S6 positioning during o-subdomain 
synthesis. 
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In adenosine 5‘ -triphosphate-binding cassette 
transporters, S7 and S8 are synthesized after the 
o-subdomain and must intercalate between S3 
and S6 to form the four-strand hydrophobic 
B-sheet core (Fig. 1, B and C). Yet, S6, S7, and S8 
are all positioned into the native NBD fold at a 
similar stage of synthesis (Fig. 1D). Because op- 
timal S7 positioning requires S8 and is a prerequi- 
site for CFTR folding (fig. S4), we tested whether 
ribosome-mediated delay of S6 positioning might 
facilitate B-sheet core formation. Indeed, ribosome 
release before core folding prevented S7 insertion, 
as evidenced by a reduction in FRET efficiency 
(Fig. 2, E and F, and fig. $3, C and D). The ribo- 
some therefore exhibits two opposite but inter- 
related effects on nascent chain conformation 


(HEK) 293T cell lysate showing core glycosylated (band B) and mature (band C) 
CFTR from total, radioimmunoprecipitation assay (RIPA)-soluble, and insoluble 
lysate fractions. Graph shows fold increase over that of WT (n = 5 replicates + SEM). 
(D) [°°S]Met-labeled CFTR immunoprecipitated from RIPA soluble and insoluble 
fractions. Graphs show band C (left) and band B (right) CFTR as percentage of total 
protein recovered at time (T) = O (n = 3 replicates + SEM). (E) Schematic showing 
proposed effect of Fast;sa5.sea) Substitutions on CFTR processing (blue arrows). 


that directly influence folding outcome: delaying 
spontaneous folding at one stage of synthesis (S6 
positioning), while enabling folding at another 
(S7/S8 intercalation). Because premature polypep- 
tide release uncouples these events, the ribosome 
appears to facilitate NBD1 folding, potentially in 
concert with associated chaperones, by maintain- 
ing a relatively open o-subdomain conformation 
to ensure orderly and sequential insertion of S6, 
S87, and S8 during a critical translational window 
(Fig. 2G). 

These results predict that B-sheet core forma- 
tion depends on the timing and/or kinetics of 
o-subdomain folding. Because the predicted trans- 
lation elongation rate (10) slows dramatically 
during o-subdomain synthesis (Fig. 3A and table 
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Fig. 4. Codon usage is integrated with sequence specific folding con- 
straints. (A) NBD1 structure (PDB 1XMI) showing location of Asp®? (left) and 
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predicted interactions with Gln®°* and Arg®° in helix H5 (right). (B) Erret profile for 


D529F and WT (from Fig. 1D) NBD1. (C) Ribosome release assay of D529F CFP-NBD1- 


50 


D567UAG truncated at residue 614. (D) Immunoblot of CFTR constructs (left) and fold 
increase over WT (right) (n = 4 replicates + SEM) as in Fig. 3C. (E) Illustration showing 
that NBD1 cotranslational folding is both delayed (upward arrows) and enhanced 
(downward arrows) by cellular biosynthetic machinery to optimize folding outcome. 


S1), we introduced synonymous codon substitu- 
tions predicted to either maximize or minimize 
the translation rate within this region (Fig. 3A 
and fig. S5). Metabolic pulse labeling verified that 
Fast(595-593) Synonymous codons did indeed in- 
crease the rate of NBD1 synthesis in cells (Fig. 3B 
and fig. S6). Remarkably, despite generating 
identical amino acid sequences (fig. S7 and table 
$2), Fast(595-593) codon substitutions resulted in 
aggregation of full-length CFTR (Band B) (Fig. 
3C) as well as isolated NBD1 (fig. S8) 1 to 4 hours 
after synthesis, suggesting either delayed con- 
version into the insoluble fraction or less effi- 
cient degradation (Fig. 3, D and E). In contrast, 
Slow(595-593) codon substitutions had no detect- 
able effect on either NBD1 or full-length wild- 
type (WT) or AF508 CFTR (Fig. 3, B to D, and figs. 
S8 and S9). Similarly, Fast substitutions within 
codons 501 to 540 had no effect on CFTR ag- 
gregation (fig. S10). 

Thus, the translation elongation rate appears 
to be tailored to folding needs of specific peptide 
regions and, in this case, kinetically couples 
o-subdomain and f-sheet core folding as H3 
and H4 emerge from the ribosome (Fig. 3A). This 
region is home to numerous suppressor muta- 
tions that improve NBD1 stability, CFTR folding 
efficiency, and AF508 CFTR processing (24, 28). 
One such variant, D529F, improves NBD1 and 
full-length CFTR folding without affecting NBD1 
thermal stability (25), suggesting that it may act 
along the folding pathway. (Single-letter abbre- 
viations for the amino acid residues are as fol- 
lows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; 
H, His; I, Ile; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; 
Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and 
Y, Tyr. In the mutants, other amino acids were 
substituted at certain locations; for example, 
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D529F indicates that aspartic acid at position 
529 was replaced by phenylalanine.) Asp*”’ makes 
several polar interactions with helix H5 (GIn*” 
and Arg®®°) (Fig. 4A) that are adjacent to the 
hydrophobic core of the a-subdomain (Ile*’, 
Cys°*, Leu”>, Leu", and Ile”**). Phe substitu- 
tion at this site might therefore affect timing of 
local a-subdomain collapse. Consistent with this 
hypothesis, D529F resulted in $7 positioning at 
an earlier stage of synthesis without affecting S6 
(Fig. 4B and fig. S11A). D529F also eliminated 
ribosome dependence for B-sheet core folding 
(Fig. 4C) and suppressed aggregation of full-length 
CFTR caused by Fast553.593) codon substitutions 
(Fig. 4D), thus restoring kinetic coupling between 
o-helical subdomain and f-sheet core. 

This study delineates cotranslational folding 
of a topologically complex protein as a series of 
dynamically modulated folding events that can 
be viewed as a function of chain length (Fig. 4E). 
As the nascent polypeptide emerges from the ri- 
bosome, formation of low FRET-associated open 
conformers is interrupted by discrete intervals of 
peptide compaction. Although each of these fold- 
ing events could be theoretically analogous to in 
vitro folding of an equivalent peptide domain (3), 
this does not appear to be the case. Rather, fold- 
ing occurs in sequential, coupled steps, the tim- 
ing of which is both positively and negatively 
influenced by biosynthetic machinery. Rapid co- 
translational folding of the N-terminal subdo- 
main likely provides a template for subsequent 
interfacial interactions that assist domain assem- 
bly (29). In contrast, optimal folding of the non- 
contiguously synthesized -strands is achieved 
by actively delaying placement of a presynthesized 
N-terminal strand (S6) until C-terminal strands 
(S7 and S8) are available. This process is coor- 
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dinated by maintaining the nascent polypeptide 
in a folding-competent conformation (4, 27) both 
by direct ribosome effects (4, 7) and the transla- 
tion rate as dictated by codon usage (JI, 12). 
Cotranslationally recruited chaperones (30), not 
examined here, may also contribute to the delay 
in S6 placement and a-subdomain collapse (13). 
We refer to this overall process as “translational 
tuning,” in which multiple mechanisms are simulta- 
neously integrated during synthesis to modulate 
intrinsic folding properties of the nascent chain. 
Translational tuning also integrates conserved 
codon usage with biophysical properties imposed 
by amino acid sequence, both of which are tai- 
lored to optimize outcome based on specific fold- 
ing requirements (31, 32). 
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Life-threatening influenza and 
impaired interferon amplification 
in human IRF7 deficiency 
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Severe influenza disease strikes otherwise healthy children and remains unexplained. We 
report compound heterozygous null mutations in /RF7, which encodes the transcription 
factor interferon regulatory factor 7, in an otherwise healthy child who suffered life- 
threatening influenza during primary infection. In response to influenza virus, the patient’s 
leukocytes and plasmacytoid dendritic cells produced very little type | and III interferons 
(IFNs). Moreover, the patient’s dermal fibroblasts and induced pluripotent stem cell 
(iPSC)—derived pulmonary epithelial cells produced reduced amounts of type | IFN and 
displayed increased influenza virus replication. These findings suggest that IRF7-dependent 
amplification of type | and III IFNs is required for protection against primary infection by 
influenza virus in humans. They also show that severe influenza may result from single- 


gene inborn errors of immunity. 


oth seasonal and pandemic influenza viruses 

typically cause self-limiting respiratory dis- 

ease but occasionally cause life-threatening 

acute respiratory distress syndrome (ARDS). 

The frequency of severe disease depends 
on the viral strain (7). Known host risk factors 
to severe influenza consist of a few acquired 
comorbidities, such as chronic pulmonary disease 
(2, 3). The pathogenesis of most cases of life- 
threatening influenza remains unexplained, espe- 
cially among children (4). The lack of severe 
influenza in patients with known primary im- 
munodeficiencies, including inborn errors of T 
and/or B cell immunity that predispose to a 
variety of related infections such as severe para- 
influenza (5), is also unexplained (6). Only hap- 
loinsufficiency for GATA2, resulting in abnormal 
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hematopoietic cell development, was reported in 
a few patients with severe influenza and other 
infections (7). Monogenic inborn errors of immu- 
nity may underlie life-threatening, isolated dis- 
eases in otherwise healthy children during primary 
infection by a few other viruses (8). We therefore 
hypothesized that severe influenza striking oth- 
erwise healthy children may also result from 
single-gene inborn errors of immunity. 

We performed whole-exome sequencing (WES) 
in a 7-year-old girl (“P”), one of 22 individuals 
sequenced (including only three children <5 years 
old) proven to have developed influenza in the 
course of primary infection (table S1). P suf- 
fered life-threatening ARDS during infection with 
laboratory-confirmed pandemic HIN1 (pHIN1) 
2009 influenza A virus ([AV) in January 2011 at 


the age of 2.5 years, prior to any influenza vacci- 
nation. Serum drawn shortly after hospitalization 
showed protective antibody titers against A/ 
Netherlands/602/2009 (HIN1) IAV (fig. S1A) 
but not against A/Perth/16/2009 (H3N2) or B/ 
Brisbane/60/2008 (fig. S1B), indicating that this 
was her first encounter with IAV. She did not 
suffer from severe infections caused by other vi- 
ruses (supplementary case report and fig. S1C). 
The patient had no known comorbidity and no 
detectable immunological abnormalities sugges- 
tive of any T or B cell deficit (table S2). She was 
born to nonconsanguineous parents of French 
descent (Fig. 1A and fig. S2A). WES analysis of the 
trio revealed, and Sanger sequencing confirmed, 
two compound heterozygous JRF7 mutations— 
p.Phe410Val (F410V) and p.Gin421X (Q421TX)—with 
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Fig. 1. Autosomal recessive IRF7 deficiency from A B 

compound heterozygous mutations. (A) Familial ee Wt PP 
segregation of IRF7 mutations in a nonconsangui- | eo 
neous French family. (B) Schematic illustration of ; ee esse Ee = 5 sc 
IRF7A featuring DNA binding domain (DBD), 

constitutive activation domain (CAD), virus-activated 

domain (VAD), inhibitory domain (ID), and signal 

response domain (SRD). A potential nuclear local- Cc 

ization signal (NLS) lies between amino acids 417 004 CI IFNB UI & os a gleee ony Me eeiay ony 

and 440, and the nuclear export signal (NES) Soi = a eeang ne owe a iin 

between amino acids 448 and 462. Phosphorylation 3 6004 Ea IFNAG Sev P-IRF7 

sites (P) Ser4”” and Ser*’? lie at the C terminus. 2 = IFNAG UI ene 

Mutations are shown in red. (C) Wild-type, F410V, 3 °°) mam IFNae sev eae —— 

or Q421X IRF7 activation of IFNB, IFNA4, or 3 ——- 

IFNA6 promoter-driven reporter assay. Cells are 3 TBI - 
uninfected (UI) or infected with SeV. Means + SD | 

of three independent experiments are shown. ene a ae Ge GAPDH | ee —_ 


*P < 0.01, **P < 0.005, ***P < 0.001 as 
determined by t test. (D) Phosphorylation of 
HA-tagged wild-type (WT), F410V, or Q421X IRF7 coexpressed with FLAG-tagged TBK1 as assessed by Western blot with phospho-specific IRF7 
antibody (P-IRF7); GAPDH was used as a loading control. This result is representative of two experiments. 
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Fig. 2. P’s IRF7 alleles are loss-of-function by different mechanisms. (A to C) Localization of FLAG-tagged wild-type (A), F410V (B), or Q421X (C) IRF7 in 
uninfected or Sendai virus-infected Vero cells by immunofluorescence imaging. This result is representative of two experiments. (D) Wild-type and 
mutant IRF7 dimerization by immunoprecipitation (IP) with antibody to FLAG followed by Western blot with antibodies to FLAG and HA. WCL, whole-cell 
lysate. (E) Localization of FLAG F410V (top) and FLAG wild-type IRF7 (bottom) cotransfected with HA Q421X IRF7 in Vero cells as assessed by immuno- 
fluorescence imaging. This result is representative of two experiments. 
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each parent being heterozygous for a single mu- 
tated allele (Fig. 1A and fig. S2B), which defined 
the best candidate genotype in this patient (9). 
Interferon regulatory factor 7 (IRF7) is a tran- 
scription factor that amplifies interferon (IFN) 
production in response to viruses (J0-12). Spe- 
cifically, IRF7 is involved in the amplification of 
mouse and human type I (13 JENA, IFNB, IFNE, 
IFNK, IFNW) and type III (IL29, IL28A, IL28B) 
IFN genes (11-13). The missense F410V substitu- 
tion is predicted to be damaging and absent in 
public databases. The nonsense Q421X (Fig. 1B 
and fig. S2B) is found as a heterozygous variant 
in 1 out of 118,062 chromosomes of the Exome 
Aggregation Consortium (ExAC) cohort, yielding 
a minor allele frequency of 0.000008. There are 
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currently no homozygous or compound hetero- 
zygous nonsynonymous mutations found in our 
in-house (table S3) and public databases (ExAC). 
Thus, autosomal recessive IRF7 deficiency by com- 
pound heterozygosity may underlie severe influ- 
enza in this child. 

Each mutation was loss-of-function in reporter 
assays driven by IFNB, IFNA4, or IFNA6 promoters 
(Fig. 1C) (1D). None of the other five heterozygous 
IRF7 variants tested was loss-of-function (table 
S4 and fig. $3). IRF7 can be activated by RIG-I 
recognition of IAV genomic RNA, resulting in 
C-terminal serine phosphorylation by the IKK- 
related kinases TBK1 and IKK-e (JI, 14-20). Wild- 
type IRF7 was phosphorylated upon overexpression 
of TBK1, as was F410V (Fig. 1D). The truncated 
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Q421X product lacks the C-terminal serine resi- 
dues and was not phosphorylated (Fig. 1D). Phos- 
phorylation induces IRF7 nuclear accumulation 
and transcription of type I and III IFNs (7, 14-16). 
Wild-type IRF7 accumulated in the nuclei of all 
transfected Vero cells by 8 hours post-infection 
(hpi) with Sendai virus (SeV) (Fig. 2A), whereas 
F410V was cytoplasmic (Fig. 2B) and Q421X was 
nuclear with or without SeV infection (Fig. 2C). 
This suggested that F410V disrupts a potential 
nuclear localization signal (amino acids 417 to 
440) and was consistent with Q421X missing the 
nuclear export signal (amino acids 448 to 462) (15). 

The two mutant IRF7 alleles led to loss of 
function by different mechanisms: F410V did not 
accumulate in the nucleus despite phosphorylation, 


f/f 
Ocscteat 


f 
SAMD4A / — IGF2BP3 


Co of 5 
ef 


NUPR1 


fo} 
3 
i=} 


ac 
A UNC-93B-4 
: g100 eee 
A a s 
rN a 2 10 
a 2 
2 
r a4 
aA 
0.1 
pHiNi HSV-1 pHINt HSV-1 


uninfected pH1N1 


C1 ° . 
c2 a . 
C3 a a 
C4 Vv vy 
C5 ° . 
iP ° + 
mother o e 
father o . 


N 
v 
2 


Fig. 3. Impaired IRF7-dependent innate immunity in leukocytes and 
pDCs. (A) The top 5% of genes, in PBMCs infected with pHIN1 at MOI 
(multiplicity of infection) = 2, whose relative changes were >2 (up or down) in 
controls and <1 in the patient, P, as analyzed by gene array. (B) Causal 
network analysis of the differentially regulated genes. (C) IFN-a production at 
24 hpi with pH1N1 IAV or HSV-1 in pDCs from four healthy controls, P, and an 
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UNC-93B-deficient individual (UNC-93B~), a control for TLR responses. 
(D) MX1 and /IL8 were measured in cells from (C) by qPCR. (E and F) The 
expression of all indicated type | IFN genes was measured by qPCR in purified 
pDCs (E) and unsorted PBMCs (F) separately infected with pH1N1 IAV at MOI = 
1. The probe for IFNA13 also detects IFNA21 mRNA. All data shown in (C) to (F) 
are representative of two independent experiments. 
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whereas Q421X resided in the nucleus without 
phosphorylation. The mutant IRF7 proteins could 
homo- or heterodimerize (Fig. 2D), which sug- 
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Fig. 4. IRF7-dependent intrinsic immunity is required for control of [AV 
infection. (A) IRF7 mRNA induction at indicated time points after IFN-p 
treatment in F-SV40. Means + SD of three replicates are shown. (B) IRF7 ex- 
pression in F-SV40 measured by Western blot 18 hours after treatment with IFN-B. 
(C) Virus titers in F-SV40 from P stably transfected luciferase (Luc) or wild-type 
IRF7 with an internal ribosome entry site—expressed red fluorescent protein, 
after infection with pHIN1 (MOI = 10) or VSV (MOI = 3). Means + SD for pHIN1 
IAV (n = 3) and VSV (n = 7) are shown. ***P < 0.001 between controls and P as 
determined by t test. (D) IRF7 expression in IFN-B- or IFN-A-treated PECs 
derived from healthy control ESCs (RUES2), SV iPSCs, or three individual clones 
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gests that coexpression might enable nuclear 
unphosphorylated Q421X to shuttle phosphoryl- 
ated cytoplasmic F410V to the nucleus, where 


IRF7 mutant heterodimers could up-regulate IFNs. 
However, there was no nuclear relocalization of 
F410V or the wild type in the presence of Q421X 
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(clones 1, 2, and 3) of P's iPSCs as detected by Western blot. (E) IFN-f 
production in PECs infected with A/PR/8/34-GFP as measured by ELISA at 
indicated time points. Means + SD of two independent experiments are shown. 
*P < 0.05 as determined by t test. (F) Staining of IAV nucleoprotein (NP) (green) 
and Nkx2.1 (red) in PECs derived from SV-iPSC control and P infected at MOI = 
1 with pHIN1 IAV. Cells derived from a single representative clone of P’s iPSCs 
are shown. (G) Percentage of Nkx2.1* cells scored as positive for [AV NP. in PECs 
untreated or treated with IFN-o (100 or 1000 U/ml) for 18 hours and infected 
with pHIN1 for 24 hours. Means + SD of two independent experiments are 
shown. *P <1 x 10° as determined by x? analysis. 
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(Fig. 2E). Consistently, there was no rescue in re- 
porter assays when the two mutants were coex- 
pressed (fig. S4A). Moreover, neither allele was 
dominant negative in terms of IRF7 or IRF3 func- 
tion (fig. $4, B to D). This is consistent with the 
lack of infectious phenotype, including severe in- 
fluenza, in P’s heterozygous parents. Overall, the 
patient’s severe influenza may result from lack of 
functional IRF7 homodimers, or IRF3-IRF7 het- 
erodimers, or both. 

We investigated the genome-wide impact of 
IAV infection in peripheral blood mononuclear 
cells (PBMCs) by microarray, quantitative poly- 
merase chain reaction (qPCR), and enzyme-linked 
immunosorbent assay (ELISA). At baseline, PBMCs 
from P displayed significant down-regulation of 
innate immune genes (fig. S5). We observed a 
robust induction of type I IFN genes JFNAI4, 
-16, -7, -2, -10, -13, and -21 (in order of relative 
change); JFNE; and [FNW1, as well as type III 
IFN genes JZ29 and JL28A, in healthy donors 
but not in P, at 8 and 16 hpi (Fig. 3A). Among 
IFN genes, the only exception was a factor of 2 
induction of IFNB in P, albeit this was less than 
in controls by a factor of 4. Ingenuity pathway 
analysis software independently predicted IRF7 
as an upstream regulator of these genes (P = 
2.02 x 107") (Fig. 3B). IFN-stimulated genes 
(ISGs) known to inhibit IAV replication—MX7 
(21), RSAD2 (22), BST2 (23), and SERPINE] (24)— 
were up-regulated normally in P, perhaps via 
stimulation by IFN-B. Her PBMCs displayed a 
profound or complete defect of IFN-a.2 produc- 
tion after infection with 11 other viruses or stim- 
ulation with Toll-like receptor (TLR) agonists (fig. 
S6). IFN-B and IFN-A1 levels were impaired but 
less so than IFN-a, whereas IL-6 production was 
normal (fig. S6, B to D). Overall, we observed an 
overwhelming and selective defect of type I and 
III IFN induction in P’s PBMCs. 

In mice and humans, IRF7 is constitutively ex- 
pressed in plasmacytoid dendritic cells (pDCs), 
resulting in pDCs being the major type I IFN- 
producing cells (25, 26). We measured IFN-o2 
production in response to pH1N1 infection in P’s 
pDCs (Fig. 3C), which were found at normal fre- 
quency (table S5). We observed no IFN-a2 pro- 
duction in pDCs from P at 24 hpi with pHIN1 
and herpes simplex virus-1 (HSV-1) (Fig. 3C). 
MX1 induction was abolished, while that of IL8 
was normal (Fig. 3D). The patient’s heterozygous 
mother produced IFN-a2 like controls (fig. S7A). 
We quantified the induction of all 20 human 
type I and type III IFN genes by qPCR in purified 
pDCs and unsorted PBMCs after infection with 
pHINI (Fig. 3, E and F). At 8 hpi, P was deficient 
for type I IFN (including the 13 IFNA, IFNE, 
IFNK, and IFNW) and type III IFN (/L29, IL28A, 
and JZ28B) genes in both cell preparations (Fig. 
3, E and F, and fig. S7B). IFNB was, however, 
mildly induced in P’s pDCs (Fig. 3E), and the in- 
duction of MX7 in P at 8 hpi was normal (fig. S7C). 
Collectively, a small amount of IRF7-independent 
IFN-f triggered early ISG up-regulation at 8 hpi 
in pDCs, but the IRF7-dependent amplification of 
type I and III IFNs was lacking for sustained ISG 
induction at 24 hpi. 
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We investigated the impact of IRF7 deficiency 
on cell-intrinsic, nonhematopoietic immunity, 
using P’s SV40-immortalized fibroblasts (F-SV40). 
Basal and IFN-B-induced JRF7 mRNA expression 
in F-SV40 from P was normal (Fig. 4A). In con- 
trast, IRF7 protein expression was diminished, 
even after IFN-B treatment (Fig. 4B). We observed 
approximately 2-log higher titers of IAV at 48 hpi 
relative to healthy controls (Fig. 4C). Stable trans- 
fection of wild-type IRF7 complemented this phe- 
notype (Fig. 4C). Wild-type IRF7 similarly rescued 
enhanced replication of vesicular stomatitis virus 
(VSV) (Fig. 4C). Highly pathogenic avian H5N1 
IAV also replicated to high titers in P’s F-SV40, 
indicating that the phenotype was not IAV strain- 
specific (fig. S8A). Further, treatment with exoge- 
nous IFN-a2b protected the fibroblasts from 
IAV and VSV replication (fig. S8, B and C). IFN-B 
production by fibroblasts was impaired after 
IAV infection (fig. S8D); however, it was normal 
after stimulation with extracellular or intracel- 
lular polyinosine-polycytidine (synthetic double- 
stranded RNA) (fig. S8E). This is consistent with 
intact IRF3-dependent signaling (fig. S4D) and de- 
tectable IFN-B production in P’s PBMCs (fig. S6A). 

IAVs first target the entire respiratory tract, 
with pHIN1 2009 viral antigen present in type I 
and type II pneumocytes in humans (2, 27). We 
generated patient-specific pulmonary epithelial 
cells (PECs) from induced pluripotent stem cells 
(iPSCs) derived from P’s primary fibroblasts (28). 
IAV replication and IFN-B induction were com- 
pared in PECs derived from an embryonic stem 
cell line (RUES2), a SeV-reprogrammed healthy 
control iPSC line (SV-iPSC), and three indepen- 
dent IRF7-deficient iPSC clones. IRF7 expression 
in response to IFN-o, IFN-B, and IFN-A1 treat- 
ment and IFN-f production in response to IAV 
were impaired in P lines (Fig. 4, D and E). Overall 
titers of IAV appeared equal regardless of JRF7 
genotype, largely because of efficient replication 
in cells negative for Nkx2.1, a pulmonary epi- 
thelium marker in these cultures. However, when 
we scored the infected Nkx2.1* cells for IAV nu- 
cleoprotein (NP) antigen (Fig. 4F), 52.4% of P’s 
PECs were double positive, versus 27.2% in con- 
trols (Fig. 4G). This phenotype was rescued by 
treatment with IFN-a2b (Fig. 4G), IFN-f, or 
IFN-A1 (fig. S9). These data suggest that impaired 
intrinsic immunity in the pulmonary epithelium 
may have contributed to P’s ARDS. Impaired IFN 
production by P’s pDCs may also have caused 
disease, as evidenced by the pulmonary cell res- 
cue with exogenous IFN-o.2b. 

A single-gene inborn error of IRF7 immunity 
can underlie life-threatening, isolated influenza 
in humans during primary infection, broaden- 
ing the range of human infections that result 
from genetic lesions (8, 9, 29, 30). IRF7 deficiency 
disrupts the main function of pDCs, the produc- 
tion of antiviral IFNs. This is distinct from 
deficiencies of IRF8 and GATA2 that impair 
development of all circulating monocytes and 
DCs and of multiple myeloid and lymphoid 
subsets, respectively (7, 31, 32). IRF7 deficiency 
also affects cell-autonomous, intrinsic immunity 
in PECs. The lack of IRF7-dependent type I and 


III IFN amplification by pDCs, PECs, or possibly 
other cell types likely underlies the patient’s severe 
influenza. Interestingly, IRF7 is redundant for 
vaccine-mediated immunity to influenza viruses, 
as the child has been healthy for 5 years with an- 
nual influenza vaccination as the sole secondary 
prevention. The IRF7-deficient child, now 7 years 
old, also displays a narrow infectious phenotype, 
restricted to severe influenza, at odds with the 
broad role of mouse IRF7 in antiviral immunity 
(0, 30). Human IRF7 seems to be largely redun- 
dant in host defense against viruses. It will be im- 
portant to search for deficiencies in JRF7 and 
related genes in children with influenza and 
other severe viral illnesses. Our study provides 
proof of principle that single-gene inborn errors 
of immunity can cause severe childhood influenza. 
IFN-based, patient-tailored therapeutic strategies 
could be helpful in life-threatening influenza of 
childhood (33, 34). 
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NEUROBIOLOGY 


SARMI activation triggers 
axon degeneration locally via 


NAD’ destruction 


Josiah Gerdts,' E.J. Brace,” Yo Sasaki,’ Aaron DiAntonio,””® Jeffrey Milbrandt*”* 


Axon degeneration is an intrinsic self-destruction program that underlies axon loss 
during injury and disease. Sterile alpha and TIR motif—containing 1 (SARM1) protein is an 
essential mediator of axon degeneration. We report that SARM1 initiates a local destruction 
program involving rapid breakdown of nicotinamide adenine dinucleotide (NAD*) after 
injury. We used an engineered protease-sensitized SARM1 to demonstrate that SARM1 
activity is required after axon injury to induce axon degeneration. Dimerization of the 
Toll-interleukin receptor (TIR) domain of SARM1 alone was sufficient to induce locally 
mediated axon degeneration. Formation of the SARM1 TIR dimer triggered rapid 
breakdown of NAD*, whereas SARM1-induced axon destruction could be counteracted by 
increased NAD* synthesis. SARM1-induced depletion of NAD* may explain the potent 
axon protection in Wallerian degeneration slow (Wld*) mutant mice. 


ells undergo regulated self-destruction dur- 
ing development and in response to stresses 
(1). Axons, the longest cellular structures 
in the body, have a locally mediated self- 
destruction program that removes dam- 
aged axons but also promotes axon loss in the 
setting of neurological disorders (2). Axon degen- 
eration is antagonized by the Wallerian degen- 
eration slow (Wld°) chimeric protein (3). The 
active moiety of Wld° is the enzyme nicotinamide 
mononucleotide adenyltransferase 1 (Nmnatl), 
which synthesizes the essential cofactor nicotin- 
amide adenine dinucleotide (NAD*) (4), but the 
function of Nmnatl and NAD* in axon protec- 
tion remains unclear (2). The protein SARM1 
(sterile alpha and TIR motif-containing 1; TIR, 
Toll-interleukin receptor) is an essential mediator 
of axon degeneration (5, 6). SARM1 is a negative 
regulator of Toll-like receptor-activated transcrip- 
tional programs (7), but its mechanism for axon 
degeneration is unknown. 
To investigate whether SARM1 functions be- 
fore or after injury, we engineered a system to 
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inactivate SARM1 with pharmacologic control. 
Protease-sensitized SARM1 (SARM?*) contains 
a tobacco etch virus (TEV) protease consensus 
sequence between the sterile alpha motif (SAM) 
and TIR domains, which are both essential for 
SARM1 function (6). SARMP* is thus cleaved and 
inactivated by TEV protease. SARM?® was fused 
to the rapamycin-binding domain Frb and the 
N-terminal portion of split TEV protease (Ntev) 
(8) and coexpressed with C-terminal split TEV 
fused to FK866 binding protein (Fkbp-Ctev), al- 
lowing rapamycin-induced cleavage (Fig. 1A and 
fig. S1). In dorsal root ganglion (DRG) neurons, 
cleavage of SARM?® was mostly complete within 
60 min of rapamycin treatment (Fig. 1B and 
fig. S2A). SARM?® functionality was verified by 
expression of SARM?® in isolated Sarm1~/~ DRG 
neurons. When Sarm17~ axons were severed (dia- 
grammed in Fig. 1C), they remained intact after 
24 hours, whereas axons of neurons expressing 
SARMP* showed degeneration measured by axon 
morphometry (Fig. 1D), similar to wild-type axons. 
SARM?® function was lost upon cleavage trig- 
gered by rapamycin in the presence of Fkbp-Ctev 
(Fig. 1, D and E) or by expression of full-length 
TEV (fig. S2B). Cleavage of SARM?*® initiated 
12 hours before or up to 2 hours after axon tran- 
section fully suppressed axon degeneration mea- 
sured 24 hours after axotomy. Because cleavage 
of SARM?® after axons were disconnected from 


cell bodies resulted in protection, SARM1 must 
function after injury to promote degeneration. 

SARM1 has no predicted enzymatic function 
but contains a TIR domain, which is the effector 
domain of Toll-like receptors (TLRs). Activation 
of TLRs results in dimerization of TIR domains 
that transmit a signal to cytosolic effector pro- 
teins (9). We tested whether multimerization of 
the TIR domain of SARM1 (sTIR) might induce 
axon degeneration. A minimal region of human 
SARMI comprising sTIR and the adjacent multi- 
merization (SAM) domains, but lacking the auto- 
inhibitory N terminus (SAM-TIR), is constitutively 
active and promotes cell and axon destruction 
in cultured DRG neurons (6). Expression of this 
activated form of SARM1 in vivo in Drosophila 
motor (Fig. 2A) or sensory neurons (fig. S3) also 
caused cell and axon destruction. This degener- 
ation was not observed in Drosophila expressing 
SAM-TIR harboring a disruptive sTIR mutation. 

To evaluate the sufficiency of sTIR dimeriza- 
tion in axon destruction, we engineered a phar- 
macologically controlled dimerizable sTIR by 
fusing it to the rapamycin-binding domains Frb 
and Fkbp (Fig. 2B) (10). We expressed Frb-sTIR 
and Fkbp-sTIR in DRG neurons and found that 
sTIR dimerization by rapamycin induced axon 
fragmentation within 12 hours (Fig. 2C) and neu- 
ronal cell death within 24 hours (Fig. 2D). sTIR- 
induced toxicity did not require the inhibition of 
mammalian target of rapamycin (mTOR), because 
the rapamycin analog AP20187, which does not 
target mTOR, also stimulated axon degeneration 
in cells expressing the homodimerizable Fkbp***Y- 
sTIR (10). SARM1 activation is thus sufficient to 
elicit axonal and neuronal destruction. Cell and 
axon degeneration were not induced upon di- 
merization of the TIR domains of TLR4 or the 
adaptor MYD88 (Fig. 2E). 

We tested whether SARM1 promotes axon de- 
generation through a local mechanism. sTIR- 
induced degeneration does not require a physical 
connection between the axon and soma: Sarm1/~ 
axons persisted after severing; however, sTIR 
dimerization by AP20187 caused fragmentation 
of these severed segments within 12 hours (Fig. 
2F). Dimerization of sTIR locally within axons 
also led to selective axon destruction. We grew 
DRG neurons in adjacent fluid compartments: 
one containing the cell bodies and proximal axons 
and the other containing only distal axons (Fig. 
2G). Application of AP20187 to both compart- 
ments led to destruction of proximal and distal 
axons, whereas selective application to the distal 
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Fig. 2. Axon degeneration and neuronal death induced by sTIR dimerization. (A) Micro- 
graph showing motor nerves of third-instar Drosophila larvae. M12-Gal4 drives expression from 
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for SARM1 activity after axotomy to induce axon degeneration. Axon degenera- 
tion is reported as the degeneration index (Dl), a morphometric ratio of frag- 
mented axon area to total axon area (13). Sarml-’~ DRG neurons treated with 
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and treated with 100 nM rapamycin at various times (pre = 12 hours pre-injury). 
(E) Micrographs show representative a-tubulin—stained axons corresponding to 
select treatment groups in (D). Scale bar, 50 um. Error bars, SEM; *P < 0.01; 
one-way analysis of variance (ANOVA) with Tukey's post-hoc test. 
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neuronal viability quantified by ethidium homodimer exclusion after 24 hours. (E) Effects of dimerization of sTIR or TIR domains of MYD88 or TLR4 on axon 
degeneration. (F) Effects of sTIR dimerization on degeneration of Sarm1”~ axons physically disconnected from cell bodies. (G) (Left) Diagram of axons growing 
through a diffusion barrier into an isolated fluid compartment. (Right) Micrographs of isolated distal axon segments after application of AP20187 globally or 
selectively to distal axons. Scale bar, 50 um. Error bars, SEM; *P < 0.01; one-way ANOVA with Tukey’s post-hoc test. 
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chamber elicited selective distal axon degenera- 
tion after 24 hours (Fig. 2G). 

SARMI TIR dimerization elicited rapid patho- 
physiologic changes: Axon degeneration and neu- 
ronal death were evident within 1.5 and 6 hours, 
respectively (fig. S4, A and B), and neuronal mi- 
tochondrial membrane potential dissipated and 
calcium accumulated with similar kinetics (fig. S4, 
Cto E). These measurements indicate early ener- 
getic failure. We thus focused on biochemical 
events leading from SARM1 activation to axonal 
demise. Axon degeneration is antagonized by the 
NAD* synthetic enzyme Nmnatl, which, like SARM1, 
functions locally within axons (11). Injured axons 
exhibit declining levels of NAD* before morpho- 
logic changes (12), but it is unknown whether 
this is a cause or consequence of axon destruction. 
Although Wld*/Nmnat1 does not increase the 
steady-state abundance of NAD* (13), in the 
setting of acute NAD* depletion it might main- 
tain sufficient levels of NAD* for viability (dia- 
grammed in Fig. 3A). We thus tested whether 
SARM1 activation leads to depletion of NAD*. 

To test whether endogenous SARM1is necessary 
for axonal loss of NAD* after axotomy, we iso- 
lated axons from cultured wild-type and Sarm1~ 
DRG neurons 3 and 4 hours after injury, a time 
when they remain morphologically intact, and 
measured the abundance of NAD* using high- 


performance liquid chromatography (HPLC). The 
abundance of NAD* decreased after injury in 
wild-type axons but remained stable in Sarm17/~ 
axons (Fig. 3B). Loss of adenosine triphosphate 
(ATP), an expected consequence of NAD* deple- 
tion, was also SARM1-dependent (fig. S5A). To 
determine whether SARM1 is also necessary for 
axotomy-induced loss of NAD* in vivo, we com- 
pared concentrations of NAD* in distal sciatic nerve 
segments from adult wild-type and Sarm1/~ mice. 
At 30 hours after injury, amounts of NAD* were 
decreased in wild-type nerves but remained stable 
in Sarm1~ nerves (Fig. 3C). At this time, injured 
nerves remained morphologically intact (fig. S5C), 
and amounts of ATP were stable (fig. S5B). 

We tested whether SARM1 activation was suf- 
ficient to elicit loss of NAD* by measuring neuronal 
NAD* after sTIR dimerization. sTIR dimeriza- 
tion by the addition of AP20187 caused rapid loss 
of NAD*; within 15 min, the abundance of NAD* 
was reduced by 66%, and by 90 min, 90% of the 
NAD* was lost (Fig. 3D). The abundance of ATP 
also declined after sTIR dimerization, but its de- 
pletion was slower than that of NAD*. 

Together these data implicate NAD* loss as a 
critical step in SARM1-mediated axon destruc- 
tion. We therefore examined whether increased 
NAD* synthesis could counteract the destruction 


program activated by sTIR dimerization. In DRG 


neurons, both axon degeneration and cell death 
initiated by sTIR dimerization were completely 
blocked by the expression of Nmnatl and nico- 
tinamide phosphoribosyltransferase (Nampt), 
which together synthesize NAD* (Fig. 3A). Pro- 
tection appeared to require NAD* synthesis, be- 
cause concurrent treatment with the Nampt 
inhibitor FK866 blocked the protection afforded 
by these enzymes (Fig. 3, E and F). Similarly, 
sTIR-induced axon degeneration and cell death 
were blocked by supplementation with the cell- 
permeant NAD* precursor nicotinamide riboside 
(NR) (Fig. 3, G and H) (14). Drosophila larvae 
expressing the dimerizable Fkbp'*°Y-sTIR in motor 
neurons that were fed AP20187 showed extensive 
axon degeneration that was blocked by coexpres- 
sion of cytosolic Nmnatl (Fig. 31D. 

To extend our analysis of biochemical events 
after SARM1 activation, we created a heterolo- 
gous human embryonic kidney (HEK293T) cell 
line (HTir) that stably expresses Frb-sTIR and 
Fkbp-sTIR. After 12 hours of sTIR dimerization 
in HTir cells, toxicity was evident, as indicated by 
the loss of ATP (fig. S6A) and altered morphology 
(fig. S6B). Both effects were blocked by NR sup- 
plementation. Inhibition of NAD* synthesis with 
FK866 increased the loss of ATP, whereas FK866 
was not toxic in the absence of sTIR dimerization 
(fig. S6, A and B). 
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Fig. 3. Loss of NAD* underlies SARM1-induced destruction. (A) Diagram of 
NAD* synthesis and inhibition by FK866; Nrk, nicotinamide riboside kinase; 
NMN, nicotinamide mononucleotide. (B) Axonal NAD* concentration in cul- 
tured wild-type and Sarml-’~ DRG neurons after axotomy; normalized to wild- 
type control. (©) NAD* concentration in distal sciatic nerve segments from 
wild-type or Sarml”~ animals after transection; wild-type n = 5; Sarml-”~ n = 9. 
(D) Neuronal NAD* and ATP concentrations after sTIR dimerization by AP20187; 
comparisons are made to O min control. (E and F) Axon degeneration (E) and 
neuronal cell death (F) induced by sTIR homodimerization (AP20187) and 
inhibition by NAD* synthetic enzymes with or without the Nampt inhibitor 


SCIENCE sciencemag.org 


FK866 (10 nM); measured 24 hours after sTIR dimerization and FK866 
application. (G@ and H) Effect of NR (1 mM) on axon degeneration (G) and 
neuronal cell death (H) induced by sTIR homodimerization (AP20187) for 
24 hours with or without NR. (1) Micrographs showing sTIR-induced motor 
axon fragmentation in third-instar Drosophila larvae blocked by cytosolic 
Nmnatl (cytNmnatl) expression. M12-Gal4 drives expression from UAS- 
mCD8-GFP (green) and UAS-FkFS®YTIR with or without UAS-cytNmnat! in 
single motor axons in each nerve (red, HRP). Degeneration score = 76 + 4% 
(control) versus 11 + 2% (cytNmnatl); P < 0.001 (t test); scale bar, 20 um. Error 
bars, SEM; *P < 0.01; one-way ANOVA with Tukey's post-hoc test. 
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To evaluate whether NAD* depletion alone is 
sufficient to induce axon destruction, we stimu- 
lated direct intracellular breakdown of NAD* by 
dimerization of the poly-ADP-ribose polymerase 
(PARP) domain of Tankyrase 1 (Inkp; diagrammed 
in Fig. 4A). We generated dimerizable Fkbp***’- 
Tnkp and showed that AP20187 treatment of cells 
expressing this construct led to loss of NAD* and 
formation of PAR (fig. S7, A and B). In the pres- 
ence of FK866, which inhibits de novo NAD* 
synthesis, Tnkp dimerization in dividing cells 
led to rapid energetic failure (ATP loss) that was 
blocked by the Tankyrase inhibitor XAV939 (fig. 
S7C). NR supplementation blocked toxicity but 
not PAR formation, indicating that NAD* loss 
and not PAR formation caused cell death (fig. $7, 
A and C). In neurons, Tnkp-induced depletion of 
NAD* caused degeneration of uninjured wild- 
type and Sarm1/~ axons (Fig. 4, B and C). More- 
over, NAD* depletion from isolated (presevered) 
Sarm17’~ axons led to degeneration (Fig. 4C). Thus, 
rapid NAD* depletion is sufficient to cause rapid 
axon loss. 


To define whether SARM1-mediated depletion 
of NAD* results from increased consumption 
or decreased synthesis of NAD*, we introduced 
exogenous NAD” and, as a control, nicotinic acid 
adenine dinucleotide (NaAD) into HTir cells by 
electroporation (15), followed by sTIR dimeriza- 
tion. Control cells showed rapid loss of endoge- 
nous NAD* within 5 min in response to sTIR 
dimerization. Electroporation in the presence 
of NAD* increased the concentration of NAD* 
by a factor of 4.3, but NAD* was rapidly con- 
sumed upon sTIR dimerization. The specificity 
of this reaction is highlighted by the stability 
of the closely related analog NaAD (Fig. 4D). 
sTIR-induced loss of NAD* thus involves the 
active consumption of NAD*. We next demon- 
strated that the consumed NAD* is converted 
to nicotinamide (Nam). When radiolabeled “C- 
NAD* was introduced into cells, 15 min of sTIR 
dimerization elicited loss of “*C-NAD* and con- 
comitant increases in “C Nam as detected by 
thin-layer chromatography (Fig. 4E). Similarly, 
sTIR dimerization in nonelectroporated cells 
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Fig. 4. Effects of NAD* breakdown on axon degeneration. (A) Diagram of NAD* manipulation using 
Tnkp dimerization. NAD* loss induced by FkbpF36V-Inkp dimerization is blocked by Tankyrase inhibitor 
XAV939 or NR and is exacerbated by FK866. (B) Axon degeneration in response to NAD* depletion By 


dimerized Tnkp and FK&866 after 24 hours (bar d) and inhibition by Tankyrase inhibitor XAV939 (100 nM; 
bar e). Representative a-tubulin—stained axons corresponding to bars b and d are shown; scale bar, 50 um. 
(C) Effect of NAD* depletion by dimerized Tnkp + FK866 on axon degeneration in Sarml~ uninjured axons 
(bar b) or isolated (cut) Sarml’~ axons (bar d). (D) Effect of sTIR dimerization on endogenous (dotted 
lines) and exogenously introduced (solid lines) NAD* or NaAD (control) in HTir cells. NaAD is undetectable 
in nonelectroporated cells. (E) Conversion of “C-NAD* in HTir cells to Nam 15 min after SARM1 TIR 
dimerization. NAD* and Nam from cell extracts and extracellular media were resolved by thin-layer 
chromatography. (F) (Top) Effect of the PARP inhibitor olaparib (100 nM) on NAD* loss induced by 1 mM 


H2O0> (10 min) or sTIR dimerization (10 min) in HTir cells. (Bottom) PAR formation after H2O2 treatment 
or sTIR dimerization in HTir cells expressing PARG shRNA and inhibition by olaparib. Error bars, SEM; 
*P < 0.01; one way ANOVA with Tukey’s post-hoc test. 
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also elicited Nam release as detected by HPLC 
(fig. S8). 

Rapid breakdown of NAD* induced by SARM1 
TIR is similar to that observed when PARP is 
activated in response to DNA damage (16). How- 
ever, NAD* breakdown induced by sTIR is PARP- 
independent. The PARP inhibitor olaparib reduced 
NAD* loss induced by H.O,, but had no effect on 
SARM1-induced loss of NAD* (Fig. 4F). Further- 
more, H2O, led to PARP-dependent accumula- 
tion of PAR, whereas no PAR was detected after 
sTIR dimerization (Fig. 4F). Finally, sTIR dimeri- 
zation in Parp1~ cells induced loss of NAD*, axon 
degeneration, and cell death (fig. S9). These cell- 
destruction phenotypes were also unaffected by 
genetic ablation of the NAD* glycohydrolase CD38, 
another major consumer of NAD* (J7) (fig. S9). 
SARM1 therefore initiates an NAD* breakdown 
program that drives axon destruction and cell 
death independently of PARP1 and CD38. 

SARM1 and its orthologs promote axonal de- 
generation (5, 6) as well as neuronal (18-20) and 
non-neuronal (21, 22) cell death. SARM1-induced 
breakdown of NAD* links axon degeneration to 
the axon-protective Wld* protein. The presence 
of Wld* or other sources of axonal Nmnat may 
allow for rapid resynthesis of NAD* and the mainte- 
nance of metabolic function, thereby counter- 
acting the destructive effects of NAD* degradation 
by SARM1. Identification of a class of neuro- 
protective drugs that increase NAD* biosynthesis 
through effects on Nampt has highlighted the 
therapeutic potential of augmented NAD* syn- 
thesis in neurological disorders (23). Our study 
provides further biological rationale for NAD* 
augmentation as a therapeutic approach. Inhibi- 
tion of SARM1-mediated NAD* loss may be an 
alternative or synergistic therapeutic strategy for 
the treatment of neurologic disorders. 
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RIBOSOME 


Mechanical force releases nascent 
chain-mediated ribosome arrest 


in vitro and in vivo 


Daniel H. Goldman, Christian M. Kaiser,”?*+ Anthony Milin,”? Maurizio Righini,” 


Ignacio Tinoco Jr.,' Carlos Bustamante 


1,245,657 + 


Protein synthesis rates can affect gene expression and the folding and activity of the 
translation product. Interactions between the nascent polypeptide and the ribosome exit 
tunnel represent one mode of regulating synthesis rates. The SecM protein arrests its own 
translation, and release of arrest at the translocon has been proposed to occur by mechanical 
force. Using optical tweezers, we demonstrate that arrest of SecM-stalled ribosomes can 
indeed be rescued by force alone and that the force needed to release stalling can be 
generated in vivo by a nascent chain folding near the ribosome tunnel exit. We formulate a 
kinetic model describing how a protein can regulate its own synthesis by the force generated 
during folding, tuning ribosome activity to structure acquisition by a nascent polypeptide. 


he ribosome translates mRNA into amino 

acid sequences that contain the informa- 

tion needed for the polypeptide to attain 

its native structure. Differential usage of 

synonymous codons and structural ele- 
ments in the mRNA modulate polypeptide 
elongation rates. Such rate variations may be 
required for proper folding and processing of 
nascent proteins (7). Moreover, interactions of 
specific nascent chain sequences (2, 3) with 
the ribosome exit tunnel (4) result in reduced 
rates of elongation. The bacterial SecM protein 
represents an example of a stalling sequence 
that interacts with the ribosome exit tunnel and 
allosterically represses the peptidyl transferase 
activity of the ribosome (4-7). Translation of 
SecM regulates expression of SecA, the motor 
component of the bacterial Sec translocon (2). 
Release of stalling in vivo requires interactions be- 
tween nascent SecM and the translocon machin- 
ery (8, 9). It has been suggested that mechanical 
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force exerted by the translocon relieves elongation 
arrest and leads to translation restart (10). 

To investigate the effect of force on the re- 
lease of SecM-stalled ribosome-nascent chains 
(RNCs), we adapted a single-molecule optical 
tweezers assay (12) (Fig. 1A), enabling the ap- 
plication of defined forces to single ribosome- 
associated nascent polypeptides. We generated 
stalled RNCs that contained the C-terminal 
domain of human calmodulin (CaM) (figs. S1 
and $2). CaM provides a mechanical fingerprint 
(12) in our experiments by exhibiting equilib- 
rium folding and unfolding (“hopping”) at ~7 pN 
(Fig. IB and supplementary materials). To detect 
release of stalled ribosomes, we used the anti- 
biotic puromycin. Puromycin binds to the ribo- 
somal A site and is incorporated into the nascent 
polypeptide, leading to its release from the 
ribosome (13). SecM-arrested ribosomes, contain- 
ing a prolyl-tRNA’™” stably bound in the A site, 
are refractory to treatment with puromycin, but 
become sensitive after arrest release, proline in- 
corporation, and translocation (/4) (figs. S3 and 
S4). In the presence of puromycin and EF-G, arrest 
release will become apparent as a rupture of the 
tether (Fig. 1B and fig. S4). 

We applied a defined, constant force to the 
molecule in the range of 10 to 30 pN and mea- 
sured the time required to restart translation, as 
measured by tether rupture. The mean restart 
times decreased with increasing force (Fig. 
1C). We calculated the rate of stalling rescue 
as a function of the applied force (Fig. 1, C and 


D, and figs. S5 and S6). By fitting the force- 
dependent rates to Bell’s model (15), we esti- 
mated a distance to the transition state (Aa’*) of 
0.4 nm [95% confidence interval (CI): 0.1 nm, 
0.8 nm] and a zero-force rupture rate (Kp) of 3 x 
10~* s* (95% CI: 0.5x10~* s4, 20 x 10°* s”). 
This rate is in agreement with biochemical en- 
semble experiments, in which no force was 
applied (Fig. 1D, blue dot and fig. S3). In the 
force range of our experiments, release of SecM- 
mediated arrest is accelerated by more than an 
order of magnitude (Fig. 1D), supporting the 
hypothesis that SecM arrest is relieved by the 
mechanical force generated by the SecA adeno- 
sine triphosphatase (ATPase). 

Cotranslational insertion of transmembrane 
helices via the translocon can release SecM- 
mediated stalling, presumably by generating 
force (16). We wondered whether folding of a 
nascent globular protein domain could gener- 
ate a force capable of modulating elongation 
by acting on peptide-tunnel interactions. Such 
interactions could serve to tune elongation rates 
to folding transitions (3, 4). The exit tunnel is 
too narrow to accommodate folded protein do- 
mains (17); therefore, as a nascent polypeptide 
emerges from the exit tunnel and folds in close 
proximity to the ribosome, it will be sterically 
excluded from the tunnel. This steric exclusion 
might generate a force that pulls on the nascent 
chain within the exit tunnel, which could mod- 
ulate ribosome activity. 

Having established that force accelerates SecM 
arrest release, we used SecM as a sensor to detect 
if nascent protein folding outside the ribosome 
can release the arrest. We constructed a library 
of plasmids encoding fusion proteins in which 
the stalling sequence (SecM17) is separated from 
the de novo-designed protein Top7 (8) by flex- 
ible linker sequences of various lengths, fol- 
lowed by a reporter green fluorescent protein 
(GFP) (Fig. 2A, fig. S7, and supplementary 
materials). Top 7 folds rapidly against an ap- 
plied force in close proximity to the ribosome 
(fig. S8). The GFP coding sequence is translated 
only upon successful release of the SecM17- 
mediated translation arrest. Variations in the 
length of the linker separating Top7 and SecM17 
would affect the translation outcome of these 
constructs (Fig. 2B). Short linker sequences will 
not allow folding of Top7 because the C termi- 
nus of the protein will be sequestered in the exit 
tunnel (Fig. 2B, top). Intermediate-length link- 
ers will allow more of the Top7 sequence to 
emerge from the ribosome tunnel and for the 
protein to fold and produce the steric exclusion 
folding force (Fig. 2B, middle). And while longer 


24 APRIL 2015 * VOL 348 ISSUE 6233 457 


RESEARCH | REPORTS 


linker sequences would also allow Top7 to fold, 
increased separation between the folding do- 
main and the ribosome should abolish the pro- 
posed force-generating steric exclusion release 
mechanism (Fig. 2B, bottom). 

We transformed Escherichia coli with the 
plasmid library containing linker lengths vary- 
ing from 4 to 28 amino acids. When grown under 
inducing conditions, a fraction of the colonies 
exhibited green fluorescence, indicating accu- 
mulation of GFP (Fig. 2C) and suggesting that 
SecM17-mediated stalling had been rescued in 
some of the transformants. We isolated and se- 
quenced plasmid DNA from 63 fluorescent 
colonies. Plasmids isolated from fluorescent 
colonies contained linker sequences between 15 


and 22 amino acids in length (Fig. 2D and fig. 
S9). Given that the SecM17 sequence contributes 
16 amino acids to the polypeptide and the 
ribosome tunnel can accommodate 30 to 35 
residues (7), a linker length of 15 to 22 amino 
acids corresponds to having the protein sequence 
barely outside the tunnel exit. These results sug- 
gest that nascent chain folding near the ribo- 
some tunnel exit can result in release of SecM 
arrest by stretching the polypeptide in the tunnel. 

When Top 7 folds near the tunnel exit, it does 
so against the steric exclusion force that it 
generates in the process. The protein must be 
able to fold against this force and remain folded 
for a sufficiently long period of time to release 
stalling by SecM. To estimate the forces gen- 
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erated by the protein, we performed optical 
tweezers force spectroscopy measurements with 
single Top7 molecules tethered by their termini 
(fig. S10). We measured the distributions of 
lifetimes of both the unfolded and folded states 
(Fig. 3, A to C). From these distributions, we 
extracted the force-dependent rates of folding 
and unfolding events (Fig. 3D and supplemen- 
tary materials) (19). Folding rates decrease with 
increasing force applied to the protein, and 
unfolding rates increase. The intersection of the 
folding and unfolding distributions occurs at 
~12 pN and represents the force at which the 
protein has equal probability of being folded or 
unfolded, a mean lifetime of 28 s for both states. 
Thus, as it emerges on the surface of the ribosome 
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Fig. 1. A direct applied force catalyzes release of SecM-mediated arrest. (A) Ex- 
perimental setup for optical tweezers experiments. When the nascent chain is transferred 


to puromycin, the assembly breaks. The structure of CaM was obtained from Protein 
Data Bank (PDB) ID 1CLL. (B) Example trace for restart experiment. After the “hopping” 
signature of CaM is observed (inset) at 7 pN, the force is raised to 20 pN. Red arrow: The tether breaks after ~3 min at 20 pN. (C) Restart lifetimes at 
each force. Red lines: Distributions returned by the right-censoring MLE. (D) Force-dependent rates for restart of SecM-stalled RNCs in the optical 
tweezers. Rates are determined as shown in (C), with error bars representing 95% Cls returned by the MLE. Red dotted line: Fit of Bell’s model to 
optical tweezers data. Ax*: 0.4 nm (95% Cl: 0.1 nm, 0.8 nm) and ko: 3.3 x 10% s+ (95% Cl: 0.5 x 10% s7}, 20 x 10°* s”). Black points: Rates 
determined with a method to account for nonspecific tether rupture (fig. S6). Error determined by bootstrapping. Blue dot: Lifetime obtained from 
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Fig. 2. Nascent protein folding near the ribosome tunnel exit can rescue SecM-mediated stal- 
ling. (A) Primary sequence of the construct used in the GFP reporter assay. (B) Schematic illustrating the 
translation outcome for a short (top), intermediate (middle), and long (bottom) linker. (C) Ultraviolet- 
illuminated image of colonies transformed with the linker library and grown under inducing conditions. (D) 
Histogram of linker lengths recovered by sequencing of fluorescent colonies. Gray shaded area: library range. 
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and folds, Top7 can exert a force of at least 12 pN 
for many seconds on the nascent chain still in 
the tunnel, before it unfolds. On the basis of our 
single-molecule results, we propose a kinetic 
scheme that describes how folding can modulate 
arrest release rates (Fig. 4A): 


Ky) kp) 
R<-UeoN-R 


ko ku) 


U and N are the unfolded and natively folded 
states of Top7; F is the force, and R the stall- 
released ribosome state, which can be accessed 
from the folded (N) state [at a rate accelerated 
by the exclusion steric force, kg(F)] or from the 
unfolded (U) state (at the basal, force-independent 
rate, Ko). ky(F) and ky(F) represent the force- 
dependent folding and unfolding rates of Top7 ob- 
tained from single-molecule experiments (fig. S11). 
The effectiveness with which the pulling force 
catalyzes stall release depends on the force gen- 
erated upon folding, the probability that the pro- 
tein folds at that force, and the lifetime of the 
folded state. To determine how translation stall 
release rates depend on these factors, we solved 
the kinetic scheme above for the effective stall 
release rate, kz ,F) (supplementary materials). An 
approximate solution that assumes equilibrium 
between N and U (good for ky >> kp) yields (see 
supplementary materials for the exact solution): 


) 


where fo is the fraction of natively folded pro- 
tein assuming equilibrium with the unfolded 


kyko 
ky 


Ieneg(F) = fo( Hon + 


Fig. 3. Top7 refolds against an applied mechanical 
load. (A) Example force ramp cycles for a single Top7 
molecule. Pulling is shown in red, relaxing in blue. 
Successive cycles are offset along the x axis for display 
purposes. (B and C) Folding and unfolding force 
distributions, respectively, for Top7 at a pulling speed of 
100 nm/s. Black line: Distributions reconstructed from 
the force-dependent rates in (D). The unfolding-force 
distribution in (C) is right-censored because the maxi- 
mum force in pulling experiments was set at 45 pN to 
avoid tether rupture. (D) Force-dependent rates of 
folding and unfolding extracted from the distributions in 
(B and C). Dashed lines: fit of Bell’s model to the force- 
dependent rates. For folding, Ax*: 6 nm (95% Cl: 4 nm, 
8 nm) and ko: 1 x 10° s* (95% Cl: 0.04 x 10° s+, 30 x 
10° s*). For unfolding, Ax*: 0.4 nm (95% Cl: 0.3 nm, 
0.6 nm) and kg: 0.01 s™? (95% Cl: 0.003 s7}, 0.03 s74) 
(supplementary materials). 


24 APRIL 2015 * VOL 348 ISSUE 6233 459 


RESEARCH | REPORTS 


—_ released 


elongating 
completed/ 
terminated 
Boz 
0.6 
0.5 
£ 
= 0.4 
oO 
2 
2 0.3 
a 
0.2 
0.1 
% 5 10 15 20 


Force generated by the folding protein (pN) 


Fig. 4. Kinetic model for folding-induced re- 
lease of stalled ribosomes. (A) Kinetic scheme 
illustrating the pathway to release of translation ar- 
rest. The nascent polypeptide can transit reversibly 
between the native and unfolded states, with rate 
constants kn(F) and ky(F). Once folded, the nascent 
protein both generates and experiences a force, “F,” 
which can drive it either irreversibly to the “released” 
state, with rate constant ke(F), or back to the 
unfolded state. In addition, the stall can be re- 
leased via the irreversible spontaneous process 
from the unfolded state, with rate constant ko, 
which is independent of force. PDB: 1QYS and 
1EMA. (B) The probability of force-catalyzed stall 
release is plotted as a function of the folding force. 


state. fo decreases with force as the equilibrium 
is tilted toward the unfolded state; in contrast, 
the term in parentheses increases with force as 
the release rate increases and the protein is 
biased toward the unfolded state (fig. S12). 
Owing mainly to the difference in experimental 
geometry between the dual-tethered Top7 in the 
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optical tweezers and the singly tethered Top7 in 
the in vivo experiments, the force-dependent 
rates likely represent an underestimate to the 
true rates (supplementary materials). Although 
the experimentally observed quantity, kz_.(F), 
represents the composite kinetics of both the 
spontaneous and force-dependent release rates, 
we calculated the probability that release pro- 
ceeds by the mechanical, as opposed to the 
spontaneous, process. This function provides 
the most likely force at which folding of the 
nascent protein leads to the release of the ribo- 
some stall and can be expressed as: 


eRe: (F) — Ko 
= Ear FT 


A plot of P(F) versus force shows that the 
probability of force-catalyzed stall release is a 
maximum near 10 pN (Fig. 4B). At forces below 
12 pN, Top7 is mostly folded (N), so release 
proceeds largely through the mechanical path, 
whose rate, Xp, increases with increasing force; 
at forces higher than 12 pN, the protein spends 
shorter and shorter times in the folded state, so 
release occurs more and more via the slower, 
spontaneous path, Xo, from the unfolded state. 

Our results provide evidence that the trans- 
locon must generate at least 10 pN of force to 
relieve SecM-induced ribosomal arrest. It appears 
that another translocase, ClpX, operates in a 
similar force regime (20, 27). Given that a num- 
ber of polypeptide sequences are known to stall 
the ribosome (3, 22-24), our results suggest 
that force can play a generally important role as 
a regulator of elongation. In instances where 
ribosome-nascent chain interactions are less ro- 
bust than the SecM system, folding could play 
an important role in modulating translation 
elongation and vice versa. For example, if such 
interactions occur near the C terminus of a new- 
ly synthesized domain, elongation would slow 
down when the polypeptide segment just out- 
side the ribosome begins to acquire stable struc- 
ture, permitting the folding of this segment to 
be completed before more of the polypeptide is 
synthesized. Likewise, folding of the domain, and 
the force so generated, may provide the signal 
for speeding up elongation through and beyond 
the regulatory signal. A force could be generated 
not only by nascent chain folding, but also by the 
binding of partners to the nascent chain outside 
the ribosome. In eukaryotic cells, ribosome pro- 
filing experiments have shown that the chaper- 
one Hsp70 relieves global stalling of ribosomes 
near the beginning of genes (25, 26), perhaps by 
generating a pulling force similar to the scenario 
of Hsp70 binding to protein aggregates (27). 
Thus, force generated by either nascent chain 


folding or chaperone binding could constitute 
an important feedback mechanism to tune elon- 
gation to folding. 
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women researchers. 


The closing date for applications is 31 July 2015. More information: www.humboldt-foundation.de/SKP_en 


Exzellenz verbindet - 
be part of a worldwide network. 


Alexander von Humboldt Foundation 
Jean-Paul-Str. 12 

53173 Bonn 

Germany 


info@avh.de 


www.humboldt-foundation.de 


IMMUNE CHECKPOINTS 


ig IHC protocols 
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CANCER IMMUNOLOGY 


Proven specificity & sensitivity. Results you can count on. 


Antibodies for PD-L1, B7-H3, B7-H4, Phospho-SLP-76 (Ser376), Phospho-Stat3 (Tyr705), and more from CST. 


24 Phospho-SLP-76 (Ser376) 
3 4 (D9D6E) Rabbit mAb #14745: 
wm Flow cytometric analysis of 
7 Jurkat cells, untreated (blue) 
1 or treated with H202 (11 mM, 
PD-L1 (E1L3N®) XP® < J 1 min; green), using #14745. 
Rabbit mAb #13684: | ¢ Set Poe 4 Anti-rabbit IgG (H+L), F(ab’), 
IHC analysis of paraffin- [3s a ‘et 8 Ney ea ee 4 Fragment (Alexa Fluor® 488 
embedded human lung a) yee ta Vi 22. Pye) & 7 ? : Conjugate) #4412 was used as 
carcinoma using #13684. “PISewa Pus ® “Os Aus ete A 48 Phospho-SLP-76 (Ser376) a secondary antibody. 


www.cellsignal.com/cancerscience 


\ Visit our website to request our Tumor Immunology Poster and for additional validation and competitor comparison data. 


© 2015 Cell Signaling Technology, Inc. Cell Signaling Technology, CST, E1L3N and XP are trademarks of Cell Signaling Technology, Inc. 


For Research Use Only. Not For Use In Diagnostic Procedures. yl Cell Signaling 
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POSITIONS OPEN 
POSTDOCTORAL FELLOWSHIP 


Postdoctoral fellowship is available to pursue re- 
search supported by NIH grants. Studies will address 
the effects of plant products such as resveratrol, indoles, 
and cannabinoids on inflammation, autoimmunity, 
and cancer. Other projects include studies on the role 
of CD44, estrogens, and dioxins on immune response. 
Ph.D. in any area of Biology is required. Experience in 
epigenomics, genomics, and/or microbiome research is 
required. Send curriculum vitae and three references to: 
Dr. Mitzi Nagarkatti, Carolina Distinguished Profes- 
There's anly one Spice sor and Chair, Department of Pathology, Microbiol- 

: ogy and Immunology, University of South Carolina 
School of Medicine, Columbia, SC 29229 or e-mail: 
postdoccamuscmed.sc.edu. USC Columbia is an Equal 
Opportunity /Affirmative Action Employer and encourages appli- 
cations from women and minorities. 
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Trondheim is the ancient Viking capital of Norway. The Nidelva River flows through the city, and you can even fish for salmon during 
your lunch break. You'll find hiking, alpine and cross-country skiing, cycling and more within a 10-minute drive of the city centre. 


Photo: Carl-Erik Eriksson 
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FUNDING OPPORTUNITIES — U.S. Department of Defense 


Defense Medical Research and Development Program 


Peer Reviewed Medical Research Program 


The Peer Reviewed Medical Research Program (PRMRP) funds exceptional research with the goal to improve the health and well-being of all military 
Service Members, Veterans, and their beneficiaries. The PRMRP received $247.5 million in fiscal year 2015 (FY15) and seeks grant applications in 
the following topic areas: 
Acupuncture DNA vaccine technology for Integrative medicine Post-traumatic osteoarthritis 
Acute lung injury post-exposure prophylaxis Interstitial cystitis Psychotropic medications 
Advanced prosthetics Dystonia Lupus Pulmonary fibrosis 
Arthritis Focal segmental glomerulosclerosis Malaria Respiratory health 
Burn pit exposure Food allergies Metals toxicology Rheumatoid arthritis 
Cardiovascular health Fragile X syndrome Mitochondrial disease Scleroderma 


Chronic migraine and Healthcare-acquired infection reduction — Nanomaterials for bone regeneration Sleep disorders 
post-traumatic headache Hepatitis B Osteoarthritis Tinnitus 


Congenital heart disease Hereditary angioedema Pancreatitis Vascular malformations 
Dengue Hydrocephalus Pathogen-inactivated dried plasma Women’s heart disease 
Diabetes Inflammatory bowel disease Polycystic kidney disease 


Descriptions of the FY15 PRMRP Program Announcements and * Clinical Trial Award ° Investigator-Initiated Research Award 
General Application Instructions are anticipated to be posted on * Discovery Award * Technology/Therapeutic Development Award 
Grants.gov by mid-April 2015: * Focused Program Award 


All applications must conform to the Program Announcements and General Application Instructions that will be available for electronic downloading from the Grants.gov website 
(all viewable under CFDA number 12.420). Execution management support will be provided by the Congressionally Directed Medical Research Programs. 


http://cdmrp.army.mil 
http://cdmrp.army.mil/funding/prmrp.shtml 


Now Recruiting: 


Biofuels Scientist, Houston, Texas 


Do you have a PhD in microbiology, biochemistry o1 


Janelia Research Campus environmental microbi 
is looking for enterprising See te oe 2 
early career scientists 
to do hands on, 
independent research. 


y with a strong desire to 


2 Then we'd like you to help 


us pilot an exciting new project. 


You'll need experie of the following 


areas: soil carbon estration, 


metabolic pathway improvement via transformation, 


enzyme hydrolysis, solid or liquid fermentation. 


Find out more and apply online at www.shell.com/ 


Apply by May 1 5, 2015. careers. In the "Students and Graduates” section, click 
Visit Janelia org/fellows to learn more on “Shell Recruitment Day” and then create an account. 
; : When entering your contact details, select “Other” in 


sponse to "Where you found out about this Shel 


opportunity,” click “Next” and then type in 
“Biodomain Researcher.” 


a _— a 
hhmi | janelia 
Research Campus | Fi in| 


Minority, Female, Disabled and Veteran EEO/AA Employer. 
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THE ONSAGER FELLOWSHIPS 


12 tenure-track positions available at NINU 


The Norwegian University of Science and Technology (NTNU) is Norway's primary 
institution for educating the future’s engineers and scientists. The university also 
has strong programmes in the social sciences, teacher education, the arts and 
humanities, medicine, architecture and fine art. NTNU’s cross-disciplinary 
research delivers creative innovations that have far-reaching social and economic 
impact and that help contribute to a better world. 


The Onsager Fellowship programme at NTNU is designed to attract the most 
talented scholars with an established reputation for high quality research and a 


commitment to learning and teaching at the university level. 


APPLY FOR A TENURE-TRACK POSITION AS AN ASSOCIATE PROFESSOR IN: 

e Linguistics e Safety and reliability of complex systems 

© Robotic vision e Marine structures for the future - marine technology 
e Molecular biodiversity e Zero emission refurbishment of the built environment 
e Medicine — bioinformatics ¢ Economics of natural resources and quantitative 

e Medicine - molecular biology preter 

e Statistical machine learning 
* Theoretical condensed matter physics More info at: www.ntnu.edu/onsagerfellowship 
e Inorganic or hybrid functional materials Closing date: 25 May. 


NTNU - Trondheim 


Norwegian University of TOP RESEARCH DEMANDS BRILLIANT MINDS 
Science and Technology ~ WE'RE ALWAYS LOOKING FOR THE BEST 


online @sciencecareers.org 
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Department of Health and Human Services 
National Institute of National Institutes of Health 
NIH) Goreral Merical Sciences National Institute of General Medical Sciences 
Division of Genetics and Developmental Biology 


DIVISION DIRECTOR 


The National Institute of General Medical Sciences (NIGMS), a major research component of the National Institutes of Health (NIH) and the Department of Health 
and Human Services (DHHS), is seeking exceptional candidates for the position of Director, Division of Genetics and Developmental Biology (GDB). Information 
about GDB is available at http://www.nigms.nih.gov/A bout/Overview/pages/gdb.aspx. 


GDB has supported many of the exciting fundamental discoveries that have led to deeper knowledge of how cells and organisms function as well as to new 
technologies and approaches. In FY 2015, the Division of Genetics and Developmental Biology had an annual budget of about $439 million and is one of five 
scientific divisions within the NIGMS. GDB is organized into two branches, one focused on genetic mechanisms and one on developmental and cellular 
processes, and has 11 scientific staff members who serve as program officers. The division director reports to the NIGMS director and is a member of the NIGMS 
senior leadership team, which helps set policies and priorities for the Institute. There are also opportunities to participate in and advise on NIH-wide activities and 
collaborations with other federal agencies. 


While concentrating on general principles of genetics, gene expression and developmental biology, often using model organisms, research supported by GDB 
underpins studies on human health and disease. The Director, GDB plans and directs a program of research grants and contracts, analyzes national research efforts 
on the problem of human genetic disease, makes recommendations to assist the National Advisory General Medical Sciences Council or other advisory 
committees or groups, identifies the need for research in the area of genetics and developmental biology, and advises universities, other centers of medical research, 
and professional and lay organizations about research needs and requirements. The Division supports research on genetic and molecular regulation, protein 
synthesis, gene expression, DNA and RNA, intracellular signaling pathway dynamics and network organization in genetics, among other areas. This position offers 
important opportunities to set scientific priorities, lead change and improve the research enterprise. 


Qualifications: Candidates must possess an M.D., Ph.D., or equivalent degree in a field relevant to the position. The ideal candidate will have considerable research 
experience and will possess a broad spectrum of scientific knowledge related to the NIGMS mission. In addition, candidates should possess recognized research 
management and leadership abilities. A strong understanding of the fundamental mechanisms of inheritance, development, and cell function is desired. The 
position will be filled under a Title 42(f) excepted service appointment. 


Salary/Benefits: Salary is competitive and will be commensurate with the experience of the candidate. A recruitment or relocation bonus may be available, and 
relocation expenses may be paid. A full package of Federal Civil Service benefits is available, including: retirement, health and life insurance, long term care insur- 
ance, leave, and a Thrift Savings Plan (401K equivalent). The successful candidate is subject to a background investigation and financial disclosure requirements. 
How to Apply: Applicants must submit a current curriculum vitae, bibliography, copy of degree, and full contact details for three references. In addition, 
applicants are asked to prepare two statements: a vision statement and a statement that addresses the specific qualification requirements (please limit both 
statements to two pages each). NIGMS will be accepting applications from April 13, 2015, and plans to have the position open for at least 45 days, but will not 
close the application process until a candidate has been selected. Please send your application package to GDBDirectorApplicants@nigms.nih.gov. You may 
contact Shea Keys with questions about this vacancy at skeys@mail.nih.gov or 301-594-0945. 


HHS and NIH are Equal Opportunity Employers 


USDA ILLI NOIS “YOU ARE WATCHING THIS BEAUTIFUL 
———— [ ECOSYSTEM BE DEGRADED BY CLIMATE 
aa UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN CHANGE OR HUMAN INTERACTION... 


United States Department of Agriculture THEN YOU SORT OF PULL UP YO 


Office of the Chief Scientist DIRECTOR OF SOCKS AND GO SEE WHAT YI 
The OFFICE OF THE CHIEF SCIENTIST in the 
DEPARTMENT OF AGRICULTURE is seeking RESEARCH FACILITIES 


to fill the position of Departmental Scientific . 
: Materials Research Laborato - 
Integr ity Officer. ry Marine conservationist and 


USDA’s Scientific Integrity Officer serves College of Engineering ny py Kenyan oo eee A 
as the principal science policy advisor to the 

Department’s Chief Scientist and Under Secretary 
for Research, Education, and Economics (REE) 
and senior USDA leadership on science policy 
and scientific and research integrity. This position 
is located within USDA’s Office of the Chief 
Scientist (OCS). The incumbent develops USDA- 
wide science policy and provides policy direction 
and guidance that supports the integrity and 
bioethics of research that benefits U.S. agriculture 
and consumers. This position also leads and 
provides direction to USDA’s Agency Scientific 
and Research Integrity Officers. The position is 
located in Washington, DC. Recruitment is at 
the GS-15 grade level; salary range $126,245 to 
$158,700 per year. More information on OCS 
can be found at www.usda.gov/wps/portal/usda/ http:/mrLillinois.edu/facilities.html. 


usdahome?navid=OCS. a j epee a F 
: Please visit http://jobs.illinois.edu to view 

Acopy of job announcement (AG-01-2015-0126) the complete position announcement and 
and application procedures are available at: WWW. application instructions. The closing date Every scientist 
usajobs.gov. Applications must be received by for this position is May 30, 2015. lige rs 
prior to 11:59 P.M., (ET), Tuesday, May 12, 2015. 54 Stor y 

ce irae eee www.inclusiveillinois.illinois.edu Read history ak WN S 

EQUAL OPPROTUNITY PROVIDER AND ead Ms story'a 


membercentral.aaas.org MEMBERCENTRAL 
EMPLOYER. Illinois is an EEO Employer/Vet/Disabled 
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University of Illinois at Urbana-Champaign 


The Frederick Seitz Materials Research 
Laboratory (MRL) at the University 

of Illinois at Urbana-Champaign, is 
seeking a dynamic individual to serve as 
Director of Research Facilities to lead the 
combined MRL Central Facilities. Our 
world-class facilities contain more than 

40 major research instruments, which are 
maintained by 20 dedicated staff scientists 
and engineers to annually provide 1,000 
researchers with over 100,000 user hours. 
Additional information about the 

MRL Central Facilities can be found at 


Discover. a 


Care 


The NIH is the center of medical and behavioral research for the Nation ----making essential medical discoveries that improve health and save lives. 


Are you an exceptional candidate with proven expertise in biomedical informatics, research analysis of clinical data, and EHR systems who can provide leadership 
to the high-priority, trans-NIH initiatives of biomedical discovery in an era of big data? This position offers a unique and exciting opportunity to serve as a Senior 
Advisor for Clinical Data Science and join a team of data science specialist with significant expertise in basic research data working on: developing an enterprise-level 
solution to support a 1 million plus patient cohort for precision medicine; developing the “Commons” — a conceptual framework to support biomedical basic and 
clinical data in the cloud and beyond; developing and maintaining clinical data standards, policies, and sustainability; training; innovating data science; improving 
internal processes for data centric grant review; and communicating and subsequently sharing actions with all stakeholders, notably other federal agencies, philanthropic 
organizations, the private sector, and international partners. Given the high visibility and priority of the PMI, the Senior Advisor will interact directly with both the 
ADDS and the NIH Director. 
Duties include: working with the PMI team to establish methods and standards for acquiring and transmitting clinical data derived from EHRs for PMI cohort 
research purposes; advising on issues related to the use of mobile devices and mHealth apps for acquisition and management of research data and participant reported 
observations; working with ADDS team to establish a synergistic exchange, such that best practices in handling unstructured research data can be translated into 
analogous best practices for research uses of clinical data; having a trans-NIH role in communicating and fostering developments surrounding clinical data, including 
the development and adoption of common clinical data elements across NIH, and developing standards on behalf of the NIH as a whole, with a particular need to 
interface with the efforts of the National Library of Medicine (NLM); helping to inform policies and regulatory affairs surrounding research uses of clinical data with 
particular emphasis on the balance of clinical data availability versus the need to protect patient privacy; advising on the development and maintenance of clinical data 
resources at NIH such as dbGaP, ClinVar, ClinGen and ClinicalTrials.gov; supporting the ADDS team to foster of new innovations surrounding big data, notably the 
establishment of the Commons; contributing to the design of intramural and extramural training programs that emphasize clinical data; contributing to the design of 
extramural funding programs as part of the BD2K initiative; and working closely with the Chief of the Laboratory for Informatics Development to synergize clinical 
informatics developments across the NIH. 
The successful candidate will have a doctoral-level degree in biomedical informatics, significant research experience in handling and analyzing clinical data, 
proven expertise working with and developing EHR systems, and strong interpersonal skills collaborating as part of a team. Appointment will be at a salary 
commensurate with experience and accomplishment and includes full Federal benefits, including leave, health and life insurance, retirement and savings 
plan (401K equivalent). 

If you are ready for an exciting leadership opportunity, please see the detailed vacancy announcement at: http://www.jobs.nih.gov (under Executive 

Careers). Application requirements apply and are due by May 11, 2015. 


NIH AND DHHS ARE EQUAL OPPORTUNITY EMPLOYERS. 


FALL CONFERENCES 


Spend three days 
engaged in dialogue 
with leading 
scientists at our 
campus near 
Washington, DC. 


hhmi janelia 


Research Campus 


Janelia conferences 

are small, highly 
specialized events 
designed to foster 
collaborative interactions 
and the exchange of 
exciting new ideas. 


Behavioral Epigenetics: 
Conserved Mechanisms in 
Diverse Model Systems 
Organizers: Ulrike Heberlein, 
Michael Meaney, Eric Nestler, 
David Sweatt 


Hypothalamic Circuits for 
Control of Survival Behaviors 
Organizers: Sydney Brenner, 
Amita Sehgal, Scott Sternson, 
Joseph Takahashi 


Janelia.org/conferences15 


May 15, 2015, 2:00 p.m. ET 


Challenges in 
Crystallography 
Organizers: Tamir Gonen, 
Ana Gonzalez, Nicholas Sauter 


Emerging Tools 

for Acquisition and 
Interpretation 

of Whole-Brain 
Functional Data 
Organizers: Philipp Keller, 
Alipasha Vaziri 


¢ No registration fee 

e Travel scholarships 
available 

e Convenient three-day 
format 

¢ Onsite accommodations at 
Janelia’s innovative campus 


Hippocampal-Entorhinal 
Complexities: Maps, Cell 
Types and Mechanisms 
Organizers: Lisa Giocomo, 
Nelson Spruston, Albert Lee, 
Jeff Magee 


Mammalian Circuits 
Underlying Touch Sensation 
Organizers: David Ginty, 

Ellen Lumpkin 


online @sciencecareers.org 
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CHAIR POSITION 
The University of Missouri 
Department of Biomedical Sciences 
College of Veterinary Medicine 


The University of Missouri College of Veterinary Medicine is seeking 
nominations/applications from outstanding academic scientists for Chair of the 
Department of Biomedical Sciences. Candidates must hold a PhD, DVM, MD 
or equivalent degree and academic credentials necessary to justify a tenured 
appointment at the rank of Professor, a robust record of past and current 
extramural research funding, as well as outstanding leadership, organizational, 
communication, and personnel management skills. 


The successful candidate will continue shared governance with faculty to 
direct the research, teaching and service activities of the department. The Chair 
is the chief academic officer of the Department and is expected to enhance 
and promote excellence in research by advancing and strengthening research 
through extramural support. Chief academic duties also include continuing the 
outstanding teaching in the professional curriculum and in graduate education. 
The appointee is expected to continue a personal program of scholarly activity 
and to stimulate faculty toward high academic achievement. The position is 
supported by an excellent start-up package, which includes appointment to 
the Alice Johnston Foster Distinguished Professorship in Veterinary Medicine, 
competitive compensation and benefits, attractive laboratory space, and access to 
state-of-the-art core facilities. Please see the listed websites for more information 
about the multidisciplinary Department (http://biomed.missouri.edu) and the 
University of Missouri and city of Columbia, MO (http://hrs.missouri.edu/ 
find-a-job/index.php). 


Applicants should submit a letter of application containing a statement of 
research interests and leadership vision, curriculum vitae, and the names and 
contact information of 3 references together as one PDF directed to the chair of 
the search committee. Applications may be submitted here: http://hrs.missouri. 
edu/find-a-job/academic/index.php (Job ID#15633). Review of applications 
will continue until the position is filled. 


The University of Missouri is an Equal Access, Equal Opportunity, 
Affirmative Action, Pro-disability and Veteran Employer. To request ADA 
accommodations, please call the Director of Accessibility and 
ADA Education at 573-882-9069. 
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fir UNIVERSITY of IOWA 
CARVER COLLEGE 
oF MEDICINE 


University of lowa Health Care 


Director, Neuroscience Institute 


The Carver College of Medicine at The University of Iowa seeks candidates for 
the position of Director, Neuroscience Institute. The University is establishing a 
Neuroscience Institute that will bring together researchers from the entire campus, 
and span the continuum from structural biology to translational medicine. This 
is an extraordinary opportunity, as substantial resources are available to recruit a 
director and multiple faculty members. In addition, the opening of the Pappajohn 
Biomedical Discovery Building provides new space to facilitate recruitment and 
thematic alignment of multiple laboratories. 


The Director is the spokesperson for the Institute and represents it in interactions 
with the college and the university, as well as nationally. Responsibilities include 
faculty recruitment; educational and research programs; fiscal planning and 
management; interaction with other departments and centers; and short term and 
long term strategic planning. The Director of the Neuroscience Institute reports 
to the Vice Dean for Research and is a member of the Medical Council, which 
provides input to the Dean. 


Candidates must have a previous record consistent with appointment as a tenured 
professor in an appropriate academic department. This includes a PhD or MD 
degree or equivalent and an outstanding record of accomplishments in research, 
teaching, and service. The successful candidate should have the demonstrated 
capacity to foster an environment in which excellence in research, teaching, 
and scholarship can flourish, as well as experience and interest in developing 
and leading research programs. Preference will be given to candidates who 
have a deep knowledge of fundamental neurobiology. Other desired attributes 
include a record of innovative and effective administrative and fiscal leadership 
and excellent interpersonal skills. A commitment to interdisciplinary research 
programs is necessary. To apply for this position visit our website at http:// 
jobs.uiowa.edu/faculty. (Requisition #66283) 


The University of Iowa is an Equal Opportunity Affirmative Action Employer. 
All qualified applicants are encouraged to apply and will receive consideration 
for employment free from discrimination on the basis of race, creed, color, 
national origin, age, sex, pregnancy, sexual orientation, gender identity, 
genetic information, religion, associational preference, status as a qualified 
individual with a disability, or status as a protected veteran. 


Herman Ostrow School of Dentistry of USC 


The Herman Ostrow School of Dentistry of USC seeks outstanding 
candidates for a tenure-track or tenured position at the rank of associate or 
full professor in the Division of Biomedical Sciences to conduct cutting-edge 
research in the areas of cell and developmental biology, tissue regeneration, 
cell signaling and/or gene regulation at the Center for Craniofacial 
Molecular Biology (CCMB). CCMB is well-known for being the leader in 
craniofacial developmental biology and tissue regeneration research. 
Examples of appropriate research interests include molecular mechanisms 
of organogenesis, stem cell biology, genetics of craniofacial malformations, 
and orofacial disease-related research. 


Candidates must have a PhD in developmental biology, stem cell biology, 
or molecular biology, and should have a DDS/DMD or equivalent dental 
degree. Candidates must have demonstrated a strong track record of NIH 
funding and high quality research with significant impact in the field of 
craniofacial developmental and stem cell biology. Exceptional candidates 
without a dental degree will be considered. Candidates with translational 
research interest are strongly encouraged to apply. The successful candidate 
must conduct independent and extramurally funded research programs 
that complement our interests as described at http://ccmb.usc.edu and 
http://pibbs.usc.edu. 


Interested applicants should submit a cover letter, complete curriculum 
vitae, statement of current and future research plans, selected recent 
publications, and arrange to have three letters of reference sent to: 


Dr. Yang Chai, Search Committee Chair 
USC Center for Craniofacial Molecular Biology 
Herman Ostrow School of Dentistry of USC 
c/o Ms. Patricia Thompson 
2250 Alcazar Street, CSA 103 
Los Angeles, CA 90033 
pathomps@usc.edu 


For more information and/or to apply: https://jobs.usc.edu/postings/32647 
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Special Job Focus: 


Microbiology 


May 15, 2015 
Reserve space by April 28* 


THERE’S A SCHENCE TO REACHING SCIENTISTS. 


For recruitment in science, there’s only one | Science 


Why choose this microbiology section for 
your advertisement? 


= Relevant ads lead off the career section with special 
Microbiology banner 


= Bonus distribution to: 


American Society for Microbiology (ASM) 
May 30-June 2, 2015 New Orleans, LA. 


Science 


* Ads accepted until May 110na 
first-come, first-served basis. 
SCIENCECAREERS.ORG 


To book your ad: advertise@sciencecareers.org 
The Americas Japan 
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e 4 b 44. x x cm 2015°2017 High-level Talent Introduction 


Announcement of Northeast Forestry University 


Established in 1952 and located in Harbin — beautiful “Ice City”, Northeast Forestry University is in national “21| Project” directly under the Ministry of Education of the 
People’s Republic of China and key construction projects of “Advantage Discipline Innovation Platform”. It is a multidisciplinary university integrating agriculture, science, 
industry, economics, management, culture, law, medicine and art with forestry science as advantage and forestry engineering as specialty, 
To further improve the overall level of teaching staff and achieve a research university with intemational view, we are hereby looking for talents sincerely and mainly introducing 
excellent domestic and foreign talents with important complementary and supporting functions for the construction of subjects and talent team. 
1. Subject Areas for Recruitment: 
Agriculture, General Computing, Computer Science, Information Science and 
Technology, Architecture, Planning, Cell & Molecular Biology, Bioinformatics, 
Ecology, Genetics, Microbiology & Virology, Plant Science, Zoology & Animal 
Science, Civil Engineering & Construction, Electrical & Electronics Engineering, 
Materials Science, Mechanical Engineering, Polymer Science & Technology. 
Veterinary Medicine, Business, Economics, Finance, and Accounting, Analytical 
Chemistry, Chemical Engineering, Inorganic Chemistry, Organic Chemistry, 
Physical Chemistry. 
| WeRecruitment Plan ; 
ing Talents or Teams: introduce leading talents or teams with international 
level and competitive capacity Ye oe” disciplines of 


Science Foundation of China and great development potential in academy and 
scientific research; age: under 35, 

5. Excellent Young Teachers: doctors with strong ability of teaching and scientific 
research; age: under 35. 

ill. Treatment 

1. Leading Talents: appointed to the post of professor; 

2, Distinguished Young Scholars: appointed to the post of professor, 

3. Excellent Young Scholars: appointed to the post of professor or associate 
professor, 

4, btiaie.2 Backbone; ss. to the post of associate eae or 


IV. Further Details and Contact Infi 
check the following links for 


High-level Talents Recruitment 


Program Fujian Normal University 
FUJIAN NORMAL UNIVERSITY 


About FNU 

A key institution of higher learning in Fujian with a century-old history and a glorious tradition, Fujian Normal University (FNU), co-sponsored by Fujian Provincial 
People’s Government and the Ministry of Education, is striving to turn itself into one of the Top 3 universities of the province. Located in Fuzhou, the capital city of 
Fujian, FNU has a student population of over 27,600, among whom about 6,600 are pursuing their graduate degrees. FNU boasts 37 first-level master’s programs and 
14 master’s programs conferring professional degrees, 19 first-level doctoral programs and 15 post-doctoral research centers. It is now well on its way to become a 
fully fledged comprehensive university. Upholding the strategy of “reinvigorating the university by talents”, FNU pledges to provide amenities and facilities to 
high-level talents who choose to work with us. 


Vacancies 

Education, psychology, applied economics, theoretical economics, law, Marxist theory, Chinese language and literature, foreign languages and literature (Spanish and 
Portuguese), sociology, journalism and communication, Chinese history, world history, public administration, business administration, sports, math, statistics 
(probability theory and mathematical statistics), computer science and technology, software engineering, physics, optical engineering, chemistry, material science, 


environmental science and engineering, geography, ecology, biology 


Benefits and Conditions 

House purchasing subsidies, settlement fees, housing allowance and research funds will be provided for successful applicants who will work at FNU as full-timers.For 
more information about the salary packaging, please refer to FNU’s Recruitment Program for High-level Talents in “Documents and Policies” at rsc.fjnu.edu.cn. 
For especially outstanding talents, a one-to-one policy will be adopted. High-level academic leaders are entitled to negotiable annual salary. Efforts will also be made 
to help build inter-college and interdisciplinary research teams. Research institutes, offices and labs will be set up and equipment and instruments will be purchased 
whenever it deems necessary. You can pursue your interests in research, HR management and the allocation of funds. 

You are also encouraged to apply for national and provincial grants for talents (such as the 1,000 Talent Plan of China, Fujian’s 100 Talent Plan and the 
Minjiang Scholars Program). For successful applicants, FNU will provide further financial support. 


Contact Us 

Website: www. fjnu.edu.cn 

Address: Office of Human Resources, Administration Bldg, Qishan Campus, Fujian Normal University 
No. | Keji Rd., Shangjie, Minhou, Univ Town Fuzhou Fujian 350117, P. R. China 

Job Vacancies: zp.fjnu.edu.cn 

Contacts: Mr. Le, Mr. Fu 

Tel: +86-0591-22867359 Email: rscrsk@fjnu.edu.cn 


online @sciencecareers.org 
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WORKING LIFE 


By Marion Ronit Munk and René Rickert 
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To work or not shouldn't be a question 


e are a two-scientist couple, an Austrian and a German, both with experience working in 

the United States. So we read with great interest the Working Life story in which Michelle 

Gabriele Sandrian, an American, shared her experience working as a postdoc in Austria 

while starting a family. (You can read it in Science at http://scim.ag/EuropePostdoc.) San- 

drian was happy to start a family in a place where she and her husband could take time off 

to care for their new baby. Sandrian and her family had a positive experience—werre glad 
about that—but that doesn’t mean Austria’s policies are good for women’s careers. 


In Austria—Germany is much the 
same—many people don’t have kids. 
Professional women tend not to have 
children, and women who do have 
children tend not to work or to work 
only part time. Men who have kids 
often have wives who stay at home. 

Austria ranks 68th in equality 
of economic participation and op- 
portunity in the World Economic 
Forum’s Global Gender Gap Report, 
below Russia, Kazakhstan, and 
Brunei. In the United States, which 
ranks fourth, 3.2% of corporate 
CEOs are women—hardly impres- 
sive, but that’s well above Europe's 
2.4%, as reported in 2013 by the 
European Commission. As of 2012, 
when the data for the commission’s 
report were collected, no major cor- 
poration in Austria had a woman as 
CEO. Not one. 

Why are women leaders so rare in a society with such gen- 
erous family-leave policies? Austria’s conservative culture 
is partly to blame, but the blame should be shared by the 
policies. For women, the Elternkarenz, which pays parents to 
stay home for up to 3 years in Germany and up to 2 years in 
Austria, is the best career killer one could imagine. And child 
care for young children is hard to find and usually covers 
only part of the day. My sister working in Germany may have 
to give up a career in surgery because her day care recently 
decided to close at 3:30 p.m. She has no alternatives. 

Long absences from work make women less competitive. 
They return to the office or the lab after a long leave to find 
that former colleagues have left them far behind. 

These policies affect all women, not just those who take 
long leaves. During the paid absence of a young parent, 
no employer may hire a permanent replacement. Because 
women are much more likely than men to take long paren- 
tal leaves, employers have a strong incentive to hire men. 
Europe’s equal-employment laws make such discrimination 


“Professional women tend 
not to have children.” 


illegal, but employers discriminate 
anyway, aS many young women in 
science and medicine can attest. 

How can the situation be im- 
proved? Women should continue 
to have sufficient time off, with 
pay, as they adjust to being moth- 
ers. But parental leaves need not 
last for years; Sandrian took just 
5 months off before going back to 
work. Policies should encourage or 
require fathers to do their share; 
Norway, one of the most success- 
ful countries in Europe in terms 
of women’s advancement, expects 
fathers to take off at least 10 weeks. 
More men taking more time off 
should reduce hiring bias and, as 
a bonus, encourage fathers to take 
on a greater share of the domestic 
burden, yielding benefits for wom- 
en’s careers for years to come. 

Most importantly, mothers need to be able to return to 
work sooner without having to worry about their babies, so 
high-quality subsidized, flexible, full-day child care needs 
to be more widely available for younger children. Part-time 
work arrangements should be available, too, but the laws 
should strongly encourage men to take equal advantage. 

If we wish to facilitate equal success for women in sci- 
ence, medicine, and other careers, changes in society are 
needed. Both women and men should demand policies 
that support mothers who choose to work and encourage 
their partners to take on their fair share of the domestic 
load. Time off is not the issue; we need to put policies in 
place that help women work. 


Marion R. Munk is an M.D-Ph.D. ophthalmologist with a 
background in immunology and clinical research. René 
Riickert is an M.D-MBA immunologist. Both now live in Bern. 
For more on life and careers, visit ScienceCareers.org. Send 
your story to SciCareerEditor@aaas.org. 
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